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Title: SARS 

5 

The invention relates to the field of virology, more in particular to a new 
coronavirus. In particular sequences encoding (parts of) viral proteins are provided. 
Further, the invention relates to diagnostic means and methods, prophylactic means 
and methods and therapeutic means and methods to be employed in the diagnosis, 

10 prevention and/or treatment of disease, in particular of respiratory disease (atypical 
pneumonia), in particular of mammals, more in particular in humans. In another 
embodiment the invention relates to the use of interferon, preferably pegylated 
interferon for the prophylactic or therapeutic treatment of animals, preferably 
vertebrates, more preferably birds or mammals, especially human, apes or rodents, 

15 infected with a coronavirus, more specifically an animal, preferably human infected 
with a SAES associated coronavirus (SAES-CqV). 

Recently, a new virus has caused a global health risk because of its pathogenic 
effects in man combined with a relatively easy droplet transmission. The virus first 
2 0 was seen in the Chinese province Guangdong, was spread to Hong Kong in February 
2003, and within two months it has been able to spread to several countries all over 
the world where it has caused 78 deaths out of 2300 people infected (New Scientist 
Online News 13:25 02 April 2003). The virus has been named SARS (Severe Acute 
Respiratory Syndrome) virus and causes a respiratory illness (atypical pneumonia) in 

2 5 man. This illness usually begins with a fever, sometimes associated with chills or 

other symptoms, including headache, rash, diarrhea, a general feeling of discomfort 
(malaise) and body aches. Some people also experience mild respiratory syndromes at 
the outset. 

After 2 to 7 days, SAES patients may develop a dry, nonproductive cough that 

3 0 might be accompanied or progress to the point where insufficient oxygen is getting 

to the blood, visible as shortness of breath. In 10% to 20% of the cases, patients will 
require mechanical ventilation, and eventually the disease can lead to the death of 
the patient. Hospital personnel, children, elderly and people having an underlying 
condition such as diabetes or heart disease, or a weakened immune system, form the 
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highest risk group. Co-infection with other pathogens seems to occur frequently, 
especially with opportunistic pathogenic microorganisms such as human 
metapneumovirus (hMPV), Chlamydia, etcetera. 

The incubation time for the virus is typically 2-7 days and the disease is 
transmitted by people sich with SARS coughing or sneezmg droplets in the air. 

As for yet it is not known if there is a cure for the disease. Several antiviral 
therapies have been applied, but with various results. 

Also, for being able to prevent spread of the disease, it is of great importance 
to be' able to recognise the disease in an early stage. Only then sufficient measures 
can be taken to isolate patients and initiate quarantaine precautions. At this moment 
there is not yet a diagnostic tool in place. 

Thus, there is great need in developing diagnostic tools and therapies for this 



The invention provides the nucleotide sequence of an isolated essentially 
mammalian positive-sense single stranded ENA virus belonging.to the 
Coronaviruses, which is the causative factor for SARS. From a phylogenetic analysis 
of the sequences of the virus (Fig. 1) it appears that the virus is an intermediate 
between the group formed by TGEV (transmissable gastroenetritis virus), PEDV 
(porcine epidemic diarrhea virus) and 229E (human coronavirus 229E) at one side, 
the group formed by BoCo (bovine coronavirus) and MHV (murine hepatitis virus) at 
an other side, and the AIBV (avian infectious bronchitis virus) oh yet another side . 
In general, bovine coronavirus seems to be the closest relative (at least for the viral 
replicase protein). 

Although phylogenetic analyses provide a convenient method of identifying a 
virus as a SARS virus several other possibly more straightforward albeit somewhat 
more coarse methods for identifying said virus or viral proteins or nucleic acids from 
said virus are herein also provided. As a rule of thumb a SARS virus can be identified 
by the percentages of homology of the virus, proteins or nucleic acids to be identified 
in comparison with viral proteins or nucleic acids identified herein by sequence. It is 
generally-known that virus species, especially RNA virus species, often constitute a 
quasi species wherein a cluster of said viruses displays heterogeneity among its 
members. Thus it is expected that each isolate may have a somewhat different 
percentage relationship with the sequences of the isolate as provided herein. 
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When one wishes to compare a virus isolate with the sequences as listed in 
figure 2, the invention provides an isolated essentially mammalian positive-sense 
single stranded RNA virus (SARS) belonging to the Coronaviruses and identifiable as 
phylogenetically corresponding thereto by determining a nucleic acid sequence of said 
5 virus and determining that said nucleic acid sequence has a percentage nucleic acid 
identity to the sequences as listed higher than the percentages identified herein for 
the nucleic acids as identified herein below in comparison with B0C0, AIPV and 
PEDV. Likewise, an isolated essentially mammalian positive-sense single stranded 
RNA virus (SARS) belonging to the Coronaviruses and identifiable as 
1 0 phylogenetically corresponding thereto by determining an amino acid sequence of 
said virus and determining that said amino acid sequence has a percentage amino 
acid homology to the sequences as listed which is essentially higher than the 
percentages provided herein in comparison with B0C0, AIPV and PEDV. 

15 With the provision of the sequence information of this SARS virus, the 

invention provides diagnostic means and methods, prophylactic means and methods 
and therapeutic means and methods to be employed in the diagnosis, prevention 
and/or treatment of disease, in particular of respiratory disease (atypical pneumonia), 
in particular of mammals, more in particular in humans. In virology, it is most 

2 0 advisory that diagnosis, prophylaxis and/or treatment of a specific viral infection is 

performed with reagents that are most specific for said specific virus causing said , 
infection. In this case this means that it is preferred that said diagnosis, prophylaxis 
and/or treatment of a SARS virus infection is performed with reagents that are most 
specific for SARS virus. This by no means however excludes the possibility that less 
25 ' specific, but sufficiently cross-reactive reagents are used instead, for example because 
they are more easily available and sufficiently address the task at hand. 
The invention for example provides a method for virologically diagnosing a SARS 
infection of an animal, in particular of a mammal, more in particular of a human 
being, comprising determining in a sample of said animal the presence of a viral 

3 0 isolate or component thereof by reacting said sample with a SARS specific nucleic 

acid or antibody according to the invention, and a method for serologically diagnosing 
a SARS infection of a mammal comprising determining in a sample of said mammal 
the presence of an antibody specifically directed against a SARS virus or component 
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agment thereof or an antigen according to the invention 
The invention also provides a diagnostic kit for diagnosing a SAES fatection 

=pss =— — i 

^example -P^ane^egronpsnchasaflnorophoreorentatio 
detect™ system used in the art (examples of suitable diagnoatie kit format • 
1 0 IF, ELISA, neutralization assav RT PPP * „ : PnSe 
unidentifi H • ""^ T ° aetem " e WhBther » n - ^ 

unrdentnaed vnua component or synthetic analogue thereof such as nucleic acid 

* suffices to analyse the nucleic acid or amino acid sequence o, said component for ' 

morepr. fe rablyatleast 2 5,morepreferablyatleast40nucle„t 1 desoraminoacid s 

feepecriveiy,, by sequence homology comparison with the provided 

sequences and with known uon-SAES viral sequences (BoCo is preferably used. • 

pwithsaid^oruon-S^v^seguences, the component of 
synthetic analogue can be identified. 

agent TT" ^ °* a novel etiological 

(herernalaocauedSARSvirusJbelongiugtotbeCoronaviridaefarnUy.andSAKS 
varus specific eompouents or synthetic analogues thereof. Coronaviruses we^t 
-atedfiom chickens in while the first human OT „navima wae propaga« in 

whrch urfect cattle, pigs, rodents, cate, dogs , birds and mam Coronals pa.til „ 

end). Thrs crowns appearance give the family its name. The envelope carries two 

ant,gen,andM,thememb r aheglye„p rot6m , whichi8involvedin ^ 
envelope formation. The genon.e is associated with a basic phosphoproten,, 
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designated N. The genome of coronaviruses, a single stranded positive-sense RNA 
strand, is typically 27-31 Kb long and contains a 5' methylated cap and a 3' poly-A 
tail, by which it can directly function as an mRNA in the infected cell. Initially the 5' 
ORF 1 (about 20 Kb) is translated to produce a viral polymerase, which then produces 
5 a full length negative sense strand. This is used as a template to produce mRNA as a 
'nested set' of transcripts, all with identical 5' non-translated leader sequence of 72 
nucleotides and coincident 3' polyadenylated ends. Bach mRNA thus produced is 
monocistronic, the genes at the 5' end being translated from the longest mRNA and 
so on. These unusual cytoplasmic structures are produced not by splicing, but by the 

1 0 polymerase during transcription. Between each of the genes there is a repeated 
intergenic sequence - AACUAAAG - which interacts with the transcriptase plus 
cellular factors to splice the leader sequence onto the start of each ORF. In some 
coronaviruses there are about 8 ORFs, coding for the proteins mentioned above, but 
also for a heamagglutenin esterase (HE), and several other non-structural proteins. 

1 5 Newly isolated viruses are phylogenetically corresponding to and thus taxonomically 
corresponding to SARS virus when comprising a gene order and/or amino acid 
sequence and/or nucleotide sequence sufficiently similar to our prototypic SARS 
virus. The highest amino acid sequence homology, between SARS virus and any of the 
known other viruses of the same family to date (BoCo or Mouse Hapatitis Virus) is 

2 0 for parts of the polymerase protein 18-61% (the % homology, and the virus to which 
the homology is depend on the region of the polymerase that is examined), as can be 
deduced when comparing the sequences given in figure 2 with sequences of other 
viruses, in particular of BoCo and Mouse Hapatitis Virus. Individual proteins or 
whole virus isolates with, respectively, higher homology than these mentioned 

2 5 maximum values are considered phylogenetically corresponding and thus 

taxonomicaUy corresponding to SARS virus, and generally will be encoded by a 
nucleic acid sequence structurally corresponding with a sequence as shown in figure 
2. Herewith the invention provides a virus phylogenetically corresponding to the 
isolated virus of which the sequences are depicted in figure 2. 

30 It should be noted that, similar to other viruses, a certain degree of variation can be 
expected to be found between SARS-viruses isolated from different sources. 
Also, the viral sequence of the SARS virus or an an isolated SARS virus gene as 
provided herein for example shows less than 95%, preferably less than 90%, more 
preferably less than 80%, more preferably less than 70% and most preferably less 
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than 650/c nucleotide sequence homology or less than 95%, preferably less than 90% 
more preferably less than 80%, more preferably less than 70% and most preferably ' 
less than 65% amino acid sequence homology with the respective nucleotide or amino 
acxd sequence of the bovine coronavirus or the murine hepatitis virus as for example 
5 can be found in Genbank (for example in accession number NC_002306 (BoCo) or 
NC_002645 (MHV)). 

Sequence divergence of SAES strains around the world may be somewhat higher, in 
analogy with other coronaviruses. 

A fair number of virus isolates have been isolated during the priority year of the 

10 present application, and it has been found that these viruses share the homology 

indicated above. The sequences of these viruses can be found in GenBank accession 

no. AY274119 (see fig. 10) or AY278741 or AY338175 or AY338174 or AY322199 or 

AY 322198 or AY322197 or AH013000 or AY322208 or AY322207 AY 322206 or 

AY322205 or AH012999 and and/or sequences depicted in 

15 http://ww.ncbi.nlm.n^^ 

keep=l& src hmode=l&unlock. Herewith the invention encompasses a virus 

phylogenetically corresponding to the isolated virus of which the sequences are 

depicted in figure 2 and/or for example the GenBank accession no. AY274119 or 

AY278741 or AY338175 or AY338174 or AY322199 or AY 322198 or AY322197 or 

2 0 AH013000orAY322208orAY322207AY322206orAY322205orAH012999and 
and/or sequences depicted in 

http://www.nchi.mtt.mh.^ 

.ke ep=l&srchmode=1 fennWlr 

25 The term "nucleotide sequence homology" as used herein denotes the presence of 
homology between two (polynucleotides. Polynucleotides have "homologous- 
sequences if the sequence of nucleotides in the two sequences is the same when 
aligned for maximum correspondence. Sequence comparison between two or more 
polynucleotides is generally performed by comparing portions of the two sequences 
3 0 over a comparison window to identify and compare local regions of sequence 

similarity. The comparison window is generally from about 20 to 200 contiguous 
nucleotides. The "percentage of sequence homology" for polynucleotides, such as 50 
60, 70, 80, 90, 95, 98, 99 or 100 percent sequence homology may be determined by ' 
comparing two optimally aligned sequences over a comparison window, wherein the 
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portion of the polynucleotide sequence in the comparison window may include 
additions or deletions (i.e. gaps) as compared to the reference sequence (which does 
not comprise additions or deletions) for optimal alignment of the two sequences. The 
percentage is calculated by: (a) determining the number of positions at which the 
5 identical nucleic acid base occurs in both sequences to yield the number of matched 
positions; (b) dividing the number of matched positions by the total number of 
positions in the window of comparison; and (c) multiplying the result by 100 to yield 
the percentage of sequence homology. Optimal alignment of sequences for comparison 
may be conducted by computerized implementations of known algorithms, or by 

10 inspection. Readily available sequence comparison and multiple sequence alignment 
algorithms are, respectively, the Basic Local Alignment Search Tool (BLAST) 
(Altschul, S.F. et al. 1990. J. Mol. Biol. 215:403; Altschul, S.F. et al. 1997. Nucleic 
Acid Res. 25:3389-3402) and ClustalW programs both available on the internet. Other 
suitable programs include GAP, BESTFIT and FASTA in the Wisconsin Genetics 

15 Software Package (Genetics Computer Group (GCG), Madison, WI, USA). 
As used herein, "substantially complementary" means that two nucleic acid 
sequences have at least about 65%, preferably about 70%, more preferably about 80%, 
even more preferably 90%, and most preferably about 98%, sequence 
complementarity to each other. This means that the primers and probes must exhibit 

2 0 sufficient complementarity to their template and target nucleic acid, respectively, to 
hybridise under stringent conditions. Therefore, the primer sequences as disclosed in 
this specification need not reflect the exact sequence of the binding region on the 
template and degenerate primers can be used. A substantially complementary primer 
sequence is one that has sufficient sequence complementarity to the amplification 

2 5 template to result in primer binding and second-strand synthesis. 

The term "hybrid" refers to a double-stranded nucleic acid molecule, or duplex, 
formed by hydrogen bonding between complementary nucleotides. The terms 
"hybridise" or "anneal" refer to the process by which single strands of nucleic acid 
sequences form double -helical segments through hydrogen bonding between 

3 0 complementary nucleotides. 

The term "ohgonucleotide" refers to a short sequence of nucleotide monomers (usually 
6 to 100 nucleotides^ joined by phosphorous linkages (e.g., phosphodiester, alkyl and 
aryl-phosphate, phosphorothioate), or non-phosphorous linkages (e.g., peptide, 
sulfamate and others). An oligonucleotide may contain modified nucleotides having 
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modified bases (e.g., 5-methyl cytosine) and modified sugar groups (e.g., 2'-0-methyl 
ribosyl, 2'-0--methoxyethyl ribosyl, 2'-fluoro ribosyl, 2'-amino ribosyl, and the like). 
Oligonucleotides may be naturally-occurring or synthetic molecules of double- and 
single-stranded DNA and double- and single-stranded RNA with circular, branched 
or linear shapes and optionaUy including domains capable of forming stable 
secondary structures (e.g., stem-and-loop and loop-stem-loop structures). 
The term "primer" as used herein refers to an oligonucleotide which is capable of 
annealing to the amplification target allowing a DNA polymerase to attach thereby 
serving as a point of initiation of DNA synthesis when placed under conditions in 
which synthesis of primer extension product which is complementary to a nucleic acid 
strand is induced, i.e., in the presence of nucleotides, and an agent for polymerization 
such as DNA polymerase and at a suitable temperature and pH. The (amplification) 
primer is preferably single stranded for maximum efficiency in amplification. 
Preferably, the primer is an oligodeoxy ribonucleotide. The primer must be 
sufficiently long to prime the synthesis of extension products in the presence of the 
agent for polymerization. The exact lengths of the primers will depend on many 
factors, including temperature and source of primer. A "pair of bi-directional primers" 
as used herein refers to one forward and one reverse primer as commonly used in the 
art of DNA amplification such as in PGR amplification. 

The term "probe" refers to a single-stranded oligonucleotide sequence that will 
recognize and form a hydrogen-bonded duplex with a complementary sequence in a 
target nucleic acid sequence analyte or its cDNA derivative. 

The terms "stringency" or "stringent hybridization conditions" refer to hybridization 
conditions that affect the stability of hybrids, e.g., temperature, salt concentration, 
pH, formamide concentration and the like. These conditions are empirically optimised 
to maximize specific binding and minimize non-specific binding of primer or probe to 
its target nucleic acid sequence. The terms as used include reference to conditions 
under which a probe or primer will hybridise to its target sequence, to a detectably 
greater degree than other. sequences (e.g. at least 2-fold over background). Stringent 
conditions are sequence dependent and will be different in different circumstances. 
Longer sequences hybridise specifically at higher temperatures. Generally, stringent 
conditions are selected to be about 5°C lower than the thermal melting point (Tm) for 
the specific sequence at a defined ionic strength and pH. The Tm is the temperature 
(under defined ionic strength and pH) at which 50% of a complementary target 
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sequence hybridises to a perfectly matched probe or primer. Typically, stringent 
conditions will be those in which the salt concentration is less than about 1.0 M Na+ 
ion, typically about 0.01 to 1,0 M Na+ ion concentration (or other salts) at pH 7.0 to 
8.3 and the temperature is at least about 30°C for short probes or primers (e.g. 10 to 
5 50 nucleotides) and at least about 60°C for long probes or primers (e.g. greater than 
50 nucleotides). Stringent conditions may also be achieved with the addition of 
destabilizing agents such as formamide. Exemplary low stringent conditions or 
"conditions of reduced stringency" include hybridization with a buffer solution of 30% 
formamide, 1 M NaCl, 1% SDS at 37°C and a wash in 2x SSC at 40°C. Exemplary 
10 high stringency conditions include hybridization in 50% formamide, 1 M NaCl, 1% 
SDS at 37°C, and a wash in O.lx SSC at 60°C. Hybridization procedures are well 
known in the art and are described in e.g. Ausubel et al, Current Protocols in 
Molecular Biology, John Wiley & Sons Inc., 1994. 

The term "antibody" includes reference to antigen binding forms of antibodies (e. g., 
15 Fab, F (ab) 2). The term "antibody" frequently refers to a polypeptide substantially 
encoded by an immunoglobulin gene or immunoglobulin genes, or fragments thereof 
which, specifically bind and recognize an analyte (antigen). However, while various 
antibody fragments can be defined in terms of the digestion of an intact antibody, one 
of skill will appreciate that such fragments may be. synthesized de novo either 
2 0 chemically or by utilizing recombinant DNA methodology. Thus, the term antibody, 
as used herein, also includes antibody fragments such as single chain Fv, chimeric 
antibodies (i. e., comprising constant and variable regions from different species), 
humanized antibodies (i. e., comprising a complementarity determining region (CDR) 
from a non-human source) and heteroconjugate antibodies (e. g., bispecific 
25 antibodies). 

"Interferon" is a term generically comprehending a group of vertebrate glycoproteins 
and proteins which are known to have various biological activities, such as antiviral, 
antiproliferative, and immunomodulatory activity at least in the species of animal 
from which such substances are derived, interferon refers to a class of small protein 
30 and glycoprotein cytokines (15-28 kD) produced by T cells, fibroblasts, and other cells 
in response to viral infection and other biological and synthetic stimuli. Interferons 
bind to specific receptors on cell membranes; their effects include inducing enzymes, 
suppressing cell proliferation, inhibiting viral proliferation, enhancing the phagocytic 
activity of macrophages, and augmenting the cytotoxic activity of T lymphocytes. 
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Interferons are divided into five major classes (alpha, beta, gamma, tau, and omega) 
and several subclasses (indicated by Arabic numerals and letters) on the basis of 
physicochemical properties, cells of origin, mode of induction, and antibody reactions. 

5 In short, the invention provides an isolated essentially mammalian positive- 

sense single stranded UNA virus (SARS) belonging to the Coronaviruses and 
identifiable as phylogeneticaUy corresponding thereto by determining a nucleic acid 
sequence of a suitable fragment of the genome of said virus and testing it in 
phylogenetic tree analyses wherein maximum likelihood trees are generated using 

1 0 100 bootstraps and 3 jumbles and finding it to be more closely phylogeneticaUy 

corresponding to a virus isolate having the sequences as depicted in figure 2 than it is 
corresponding to a virus isolate of BoCo (bovine coronavirus, e.g. acc. no. NC_002306 
in Genbank), MHV (murine hepatitis virus, e.g. acc. no. NC.002645), AIBV (avian 
infectious bronchitis virus, e.g. ace. no. NC_001451), PEDV (porcine epidemic 

15 diarrhea virus), TGEV (transmissible gastroenteritis virus, e.g. acc. no. NC_003436) 
or 229E (human coronavirus 229E, e.g. acc. no. NC_003045). All the viral sequences 
with the GenBank accession numbers mentioned above are believed to be 
phylogenticaUy cooresponding viruses to the virus of which the sequences are 
depicted in fig. 2. 

2 0 Suitable nucleic acid genome fragments each useful for such phylogenetic tree 

analyses are for example any of the RAP-PCR fragments EMC-1 to -14 and FJDG-l 
as disclosed in figure 2, leading to the phylogenetic tree analysis as. disclosed in figure 
1. 

A suitable open reading frame (OEF) comprises the QRF encoding the viral 

2 5 polymerase (ORF la). When an overall amino acid identity of at least 60%, preferably 

of at least 70%, more preferably of at least 80%, more preferably of at least 90%, most 
preferably of at least 95% of the analysed polymerase with the polymerase having a 
sequence comprising the amino acid fragments EMC-1, EMC-2, EMC-3, EMC-4.EMC- 
; 5, EMC-13 and/or EMC- 14 of figure 2 is found, the analysed virus isolate comprises a 

3 0 SARS virus isolate according to the invention. 

Another suitable open reading frame (ORF) useful in phylogenetic analyses 
comprises the ORF encoding the N protein. When an overall amino acid identity of at 
least 60%, more preferably of at least70%, more preferably of at least 80%, more 
preferably of at least 90%, most preferably of at least 95% of the analysed N-protein 
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with the N-protein encoded by a sequence comprising the sequence EMC-8 of figure 2 
is found, the analysed virus isolate comprises a SAKS isolate according to the 
invention. 

Another suitable open reading frame (ORF) useful in phylogenetic analyses 
5 comprises the ORF encoding the spite protein S. When an overall amino acid identity 
of at least 60%, more preferably of at least 70%, more preferably of at least 80%, more 
preferably of at least 90%, most preferably of at least 95%of the analysed S-protein 
encoded by a sequence comprising the sequence of translation 2 of EMC7 and 
translation 1 of the RDG 1 sequence of the S-protein as depicted in figure 2 is found, 
10 the analysed virus isolate comprises a SABS virus isolate according to the invention. 
The S ORF of the SARS virus seems to be located adjacent to the ORF lab (coding for 
the viral polymerase), which would discriminate SARS viruses from the bovine 
coronavirus and the murine hepatitis virus, which have a so-called 2a gene and an 
HE-gene between the S protein and the viral polymerase. 

15 

The invention provides among others an isolated or recombinant nucleic acid 
or virus-specific functional fragment thereof obtainable from a virus according to the 
invention. The isolated or recombinant nucleic acids comprises the sequences as given 
in figure 2 or sequences of homologues which are able to hybridise with those under 

2 0 stringent conditions. In particular, the invention provides primers and/or probes 

suitable for identifying a SARS virus nucleic acid. 

Furthermore, the invention provides a vector comprising a nucleic acid according to 
the invention. To begin with, vectors such as plasmid vectors containing (parts of) the 
genome of SARS virus, virus vectors containing (parts of) the genome of SARS (for 
25 example, but not limited thereto, vaccinia virus, retroviruses, baculovirus), or SARS 
virus containing (parts of) the genome of other viruse or other pathogens are 
provided. 

Also, the invention provides a host cell comprising a nucleic acid or a vector according 
to the invention. Plasmid or viral vectors containing the polymerase components of 

3 0 SARS virus are generated in prokaryotic cells for the expression 6f the components in 

relevant cell types (bacteria, insect cells, eukaryotic cells). Plasmid or viral vectors 
containing full-length or partial copies of the SARS virus genome will be generated in 
prokaryotic cells for the expression of viral nucleic acids in-vitro or in-vivo. The latter 
vectors may contain other viral sequences for the generation of chimeric viruses or 
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chimeric virus proteins, may lack parts of the viral genome for the generation of 
replication defective virus, and may contain mutations, deletions or insertions for the 
generation of attenuated viruses. 

Infectious copies of SARS virus (being wild type/attenuated, repHcation-defective or 
chimeric) can be produced upon co-expression of the polymerase components 
according to the state-of-the-art technologies described above. 

In addition, eukaryotic cells, transiently or stably expressing one or more full-length 
or partial SAES virus proteins can be used. Such cells can be made by transfection 
(proteins or nucleic acid vectors), infection (viral vectors) or transduction (viral 
vectors) and may be useful for complementation of mentioned wild type, attenuated, 
replication-defective or chimeric viruses. 

A chimeric virus may be of particular use for the generation of recombinant 
vaccines protecting against two or more viruses. For example, it can be envisaged 
that a SARS virus vector expressing one or more proteins of a human 
metapneumovirus or a human metapneumovirus vector expressing one or more 
proteins of SAES virus will protect individuals vaccinated with such vector against 
both virus infections. Such a specific chimeric virus is particularly useful in the 
invention because it is suspected that co-infection of, for instance, human 
metapneumovirus frequently occurs in SARS virus infected patients. Attenuated and 
replication-defective viruses may be of use for vaccination purposes with live vaccines 
as has been suggested for other viruses. Recently, Subbarao, K et al., J. Virol. 78(7), 
3572-3577, 2004) demonstrated that mice are protected as a result from a previous 
immunisation with whole viruses. 

In a preferred embodiment, the invention provides a proteinaceous molecule or 
coronavirus-specific viral protein or functional fragment thereof encoded by a nucleic 
acid according to the invention. Useful proteinaceous molecules are for example 
derived from any of the genes or genomic fragments derivable from a virus according 
to the invention. Such molecules, or antigenic fragments thereof, as provided herein, 
are for example useful in diagnostic methods or kits and in pharmaceutical 
compositions such as sub-unit vaccines and inhibitory peptides. Particularly useful 
are the viral polymerase protein, the spike protein, the nucleocapsid or antigenic 
fragments thereof for inclusion as antigen or subunit immunogen, but inactivated 
whole virus can also be used. Pa'rticulary useful are also those proteinaceous 
substances that are encoded by recombinant nucleic acid fragments that are 
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identified for phylogenetic analyses, of course preferred are those that are within the 
preferred bounds and metes of ORFs useful in phylogenetic analyses, in particular for 
eliciting SARS virus specific antibodies, whether in vivo (e.g. for protective puposes or 
for providing diagnostic antibodies) or in vitro (e.g. by phage display technology or 
5 another technique useful for generating synthetic antibodies). 

Also provided herein are antibodies, be it natural polyclonal or monoclonal, or 
synthetic (e.g. (phage) library-derived binding molecules) antibodies that specifically 
react with an antigen comprising a proteinaceous molecule or SARS virus-specific 
functional fragment thereof according to the invention. A person skilled in the art 

1 0 will be able to develop (monoclonal) antibodies using isolated virus material and/or 
recombinantly expressed viral proteins. Sui et al. (Proc. Natl. Acad. Sci. 101(8), 2536- 
2541, 2004) have transiently expressed fragments of the spike protein and found 
several antibodies through phage display methods. One of these antibodies was 
shown to be directed to the N-terminal 261-672 amino acids of the S (spike) protein 

1 5 (which would be corresponding to the sequence of translation 2 of EMC7 and 

translation 1 of the RDG 1 sequence of the S-protein as depicted in figure 2 ) and this 
antibody was also demonstrated to have neutralising properties, indicating that it 
may be a candidate for succesfull vaccines. Also Subbarao et al. (supra) showed that 
serum from mice that had been infected with SARS virus was able to block infectivity 

2 0 of 100 TCIDso of SARS virus in Vero cell monolayers, due to the presence of 
neutralising antibodies. 

Such antibodies are also useful in a method for identifying a viral isolate as a 
SARS virus comprising reacting said viral isolate or a component thereof with an 
antibody as provided herein. This can for example be achieved by using purified or 

2 5 non-purified SARS virus or parts thereof (proteins, peptides) using ELISA, RIA, 

FACS or similar formats of antigen detection assays (Current Protocols in 
Immunology). Alternatively, infected cells or cell cultures may be used to identify 
viral antigens using classical immunofluorescence or immunohistochemical 
techniques. Specifically useful in this respect are antibodies raised against SARS 

3 0 virus proteins which are encoded by a nucleotide sequence comprising one or more of 

the fragments disclosed in figure 2. 
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Other methods for identifying a viral isolate as a SARS virus comprise 
reacting said viral isolate or a component thereof with a virus specific nucleic acid 
according to the invention. 

In this way the invention provides a viral isolate identifiable with a method according 
5 to the invention as a mammalian virus i taxonomically corresponding to a positive- 
sense single stranded RNA virus identifiable as likely belonging to the SARS virus 
genus within the family of Coronaviruses. 

The method, is useful in a method for virologicaUy diagnosing a SARS virus infection 
of a mammal, said method for example comprising determining in a sample of said 

1 0 mammal the presence of a viral isolate or component thereof by reacting said sample 
with a nucleic acid or an antibody according to the invention. 
Methods of the invention can in principle be performed by using any nucleic acid 
amplification method, such as the Polymerase Chain Reaction (PCR; MuUis 1987, 
U.S. Pat. No. 4,683,195, 4,683,202, en 4,800,159) or by using amplification reactions 

1 5 such as Ligase Chain Reaction (LCR; Barany 1991, Proc. Natl. Acad. Sci. USA 
88:189-193; BP Appl. No., 320,308), Self-Sustained Sequence Replication (3SR; 
Guatelli et al, 1990, Proc. Natl. Acad. Sci. USA 87: 1874-1878), Strand Displacement 
Amplification (SDA; U.S. Pat. Nos. 5,270, 184, en 5,455, 166), Transcriptional 
Amplification System (TAS; Kwoh et al., Proc. Natl. Acad. Sci. USA 86:1173-1177), Q- 

2 0 Beta Replicase (Lizardi et al., 1988, Bio/Technology 6:1197), Rolling Circle 

Amplification (RCA; U.S. Pat. No. 5,871,921), Nucleic Acid Sequence Based 
Amplification (NASBA), Cleavase Fragment Length Polymorphism (U.S. Pat. No. 
5,719,028), Isothermal and Chimeric Primer-initiated Amplification of Nucleic Acid 
(ICAN), Ramification-extension Amplification Method (RAM; U.S. Pat. Nos. 5,719,028 
25- and 5, 942, 3 91) or other suitable methods for amplification of nucleic acids. 

In order to amplify a nucleic acid with a small number of mismatches to one or more 
of the amplification primers, an amplification reaction may be performed under 
conditions of reduced stringency (e.g. a PCR amplification using an annealing 
temperature of 38°C, or the presence of 3.5 mM MgCl2). The person skilled in- the art 

3 0 will be able to select conditions of suitable stringency. 

The primers herein are selected to be "substantially" complementary (i.e. at least 
65%, more preferably at least 80% perfectly complementary) to their target regions 
present on the different strands of each specific sequence to be amplified. It is 
possible to use primer sequences containing e.g. inositol residues or ambiguous bases 
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or even primers that contain one or more mismatches when compared to the target 
sequence. In general, sequences that exhibit at least 65%, more preferably at least 
80% homology with the target DNA or RNA oligonucleotide sequences, are considered 
suitable for use in a method of the present invention. Sequence mismatches are also 
5 not critical when using low stringency hybridization conditions. 

The detection of the amplification products can in principle be accomplished by any 
suitable method known in the art. The detection fragments may be directly stained or 
labelled with radioactive labels, antibodies, luminescent dyes, fluorescent dyes, or 
enzyme reagents. Direct DNA stains include for example intercalating dyes such as 

1 0 acridine orange, ethidium bromide, ethidium monoazide or Hoechst dyes. 

Alternatively, the DNA or RNA fragments may be detected by incorporation of 
labelled dNTP bases into the synthesized fragments. Detection labels which may be 
associated with nucleotide bases include e.g. fluorescein, cyanine dye or BrdUrd. 
When using a probe-based detection system, a suitable detection procedure for use in 

15 the present invention may for example comprise an enzyme immunoassay (EIA) 
format (Jacobs et'al., 1997, J. Clin. Microbiol. 35, 791-795). For performing a 
detection by manner of the EIA procedure, either the forward or the reverse primer 
used in the amplification reaction may comprise a capturing group, such as a biotin 
group for immobilization of target DNA PGR amplicons on e.g. a streptavidin coated 
. 2 0 microtiter plate wells for subsequent EIA detection of target DNA -amplicons (see 
below). The skilled person will understand that other groups for immobilization of 
target DNA PCR amplicons in an EIA format may be employed. 
Probes useful for the detection of the target DNA as disclosed herein preferably bind 
only to at least a part of the DNA sequence region as amplified by the DNA 

2 5 amplification procedure. Those of skill in the art can prepare suitable probes for 

detection based on the nucleotide sequence of the target DNA without undue 
experimentation as set out herein. Also the complementary nucleotide sequences, 
whether DNA or RNA or chemically synthesized analogs, of the target DNA may 
suitably be used as type-specific detection probes in a method of the invention, 

3 0 provided that such a complementary strand is amplified in the amplification reaction 

employed. 

Suitable detection procedures for use herein may for example comprise 
immobilization of the amplicons and probing the DNA sequences thereof by e.g. 
southern blotting. Other formats may comprise an EIA format as described above. To 
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facilitate the detection of binding, the specific amplicon detection probes may 
comprise a label moiety such as a fluorophore, a chromophore, an enzyme or a radio- 
label, so as to facilitate monitoring of binding of the probes to the reaction product of 
the amplification reaction. Such labels are well-known to those skilled in the art and 
include, for example, fluorescein isothiocyanate (FITC), p-galactosidase, horseradish 
peroxidase, streptavidin, biotin, digoxigenin, 35S or 1251. Other examples will be 
apparent to those skilled in the art. 

Detection may also be performed by a so called reverse line blot (RLB) assay, such as 
for iristance described by Van den Brule et al. (2002, J. Clin. Microbiol. 40, 779-787). 
For this purpose RLB probes are preferably synthesized with a 5" amino group for 
subsequent immobilization on e.g. carboxyl-coated nylon membranes. The advantage 
of an RLB format is the ease of the system and its speed, thus allowing for high 
throughput sample processing. 

The use of nucleic acid probes for the detection of RNA or DNA fragments is well 
known in the art. Mostly these procedure comprise the hybridization of the target 
nucleic acid with the probe followed by post-hybridization washings. Specificity is 
typically the function of post-hybridization washes, the critical factors being the ionic 
strength and temperature of the final wash solution. For nucleic acid hybrids, the Tm 
can be approximated from the equation of Meinkoth and Wahl, Anal. Biochem., 138: 
267-284 (1984): Tm = 81.5 °C + 16.6 (log M) + 0.41 (% GQ-0.61 (% form)-500/L; where 
M is the molarity of monovalent cations, % GC is the percentage of guanosine and 
cytosine nucleotides in the nucleic acid, % form is the percentage of formamide in the 
hybridization solution, and L is the length of the hybrid in base pairs. The Tm is the 
temperature (under defined ionic strength and pH) at which 50% of a complementary 
target sequence hybridizes to a perfectly matched probe. Tm is reduced by about 1 °C 
for each 1 % of mismatching; thus, the hybridization and/or wash conditions can be 
adjusted to hybridize to sequences of the desired identity. For example, resequences 
with > 90% identity are sought, the Tm can be decreased 10°C. Generally, stringent 
conditions are selected to be about 5 °C lower than the thermal melting point (Tm) for 
the specific sequence and its complement at a defined ionic strength and pH. 
However, severely stringent conditions can utilize a hybridization and/or wash at 
1,2,3, or 4 °C lower than the thermal melting point (Tm); moderately stringent 
conditions can utilize a hybridization and/or wash at 6,7,8,9, or 10 °C lower than the 
thermal melting point (Tm); low stringency conditions can utilize a hybridization 
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and/or wash at 11,12,13,14,15, or 20 °C lower than the thermal melting point (Tm). 
Using the equation, hybridization and wash compositions, and desired Tm, those of 
ordinary skill will understand that variations in the stringency of hybridization 
and/or wash solutions are inherently described. If the desired degree of mismatching 
.5 results in a Tm of less than 45 °C (aqueous solution) or 32 °C (formamide solution) it 
is preferred to increase the SSC concentration so that a higher temperature can be 
used. An extensive guide to the hybridization of nucleic acids is found in Tijssen, 
Laboratory Techniques in Biochemistm and Molecular Biology — Hybridization with 
Nucleic Acid Probes, Part I, Chapter 2" Overview of principles of hybridization and 

1 0 the strategy of nucleic acid probe assays", Elsevier. New York (1993); and Current 
Protocols in Molecular Biology, Chapter 2, Ausubel, et al., Eds., Greene Publishing 
and Wiley-Interscience, New York (1995). 

In another aspect, the invention provides oligonucleotide probes for the 
generic detection of target RNA or DNA. The detection probes herein are selected to 

15 be "substantially" complementary to one of the strands of the double stranded nucleic 
acids generated by an amplification reaction of the invention. Preferably the probes 
are substantially complementary to the immobilizable, e.g. biotin labelled, antisense 
strands of the amplicons generated from the target RNA or DNA. 

It is allowable for detection probes of the present invention to contain one or 

2 0 more mismatches to their target sequence. In general, sequences that exhibit at least 

65%, more preferably at least 80% homology with the target oligonucleotide 
sequences are considered suitable for use in a method of the present invention. 
Antibodies, both monoclonal and polyclonal, can also be used for detection purpose in 
the present invention, for example, in immunoassays in which they can be utilized in 
.25 liquid phase or bound to a solid phase carrier. In addition, the monoclonal antibodies 
in these immunoassays can be detectably labeled in various ways. A variety of 
immunoassay formats may be used to select antibodies specifically reactive with a 
particular protein (or other analyte). For example, solid-phase ELISA immunoassays 
are routinely used to select moraxrlonalTmtihodies specifically immunoreactive with a 

3 0 protein. See Harlow and Lane, Antibodies, A Laboratory Manual, Cold Spring Harbor 

Publications, New York (1988), for a description of immunoassay formats and 
conditions that can be used to determine selective binding. Examples of types of 
immunoassays that can utilize antibodies of the invention are competitive and non- 
competitive immunoassays in either a direct or indirect format. Examples of such 
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immunoassays are the radioimmunoassay (RIA) and the sandwich (immunometric) 
assay. Detection of the antigens using the antibodies of the invention can be done 
utilizing immunoassays that are run in either the forward, reverse, or simultaneous 
modes, including immunohistochemical assays on physiological samples. Those of 
skill in the art will know, or can readily discern, other immunoassay formats without 
undue experimentation. 

Antibodies can be bound to many different carriers and used to detect the 
presence of the target molecules. Examples of well-known carriers include glass, 
polystyrene, polypropylene, polyethylene, dextran, nylon, amylases, natural and 
modified celluloses, polyacrylamides, agaroses and magnetite. The nature of the 
carrier can be either soluble or insoluble for purposes of the invention. Those skilled 
in the art will know of other suitable carriers for binding monoclonal antibodies, or 
will be able to ascertain such using routine experimentation. 

The invention also provides a method for serologically diagnosing a SARS 
virus infection of a mammal comprising determining in a sample of said mammal the 
presence of an antibody specifically directed against a SARS virus or component 
thereof by reacting said sample with a proteinaceous molecule or fragment thereof or 
an antigen according to the invention 

Methods and means provided herein are particularly useful in a diagnostic kit 
for diagnosing a SARS virus infection, be it by virological or serological diagnosis. 
Such kits or assays may for example comprise a virus, a nucleic acid, a proteinaceous 
molecule or fragment thereof, an antigen and/or an antibody according to the 
invention. 

Use of a virus, a nucleic acid, a proteinaceous molecule or fragment thereof, an 
antigen and/or an antibody according to the invention is also provided for the 
production of a pharmaceutical composition, for example for the treatment or 
prevention of SARS virus infections and/or for the treatment or prevention of atypical 
pneumonia, in particular in humans. Preferably a peptide comprising part of the 
amino acid sequence of the spike protein as depicted in translation 2 with the 
sequence EMC7 and translation 1 of the RDG seq of figure 2, is used for the 
preparation of a therapeutic or prophylactic peptide. Also preferably, a protein 
comprising the amino acid sequence of the spike protein as depicted in translation 2 
with the sequence EMC7 translation 1 of the RDG seqof figure 2, is used for the 
preparation of a sub-unit vaccine. Furthermore, the nucleocapsid of Cornoviruses, as 
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depicted in the translation of EMC8, in figure 2, is known to be particularly useful for 
eliciting cell-mediated immunity against Coronaviruses and can be used for the 
preparation of a sub-unit vaccine. 

Attenuation of the virus can be achieved by established methods developed for 
5 this purpose, including but not Hmited to the use of related viruses of other species, 
serial passages through laboratory animals or/and tissue/cell cultures, serial 
passages through cell cultures at temparutes below 37C (cold-adaption), site directed 
mutagenesis of molecular clones and exchange of genes or gene fragments between 
related viruses. 

10 As is shown by Sui et al. (supra) humanised neutralising antibodies have been 

prepared which have shown to be reactive with the N- terminal 261-672 amino acids 
of the spike protein of the SAKS virus. 

A pharmaceutical composition comprising a virus, a nucleic acid, a 
proteinaceous molecule or fragment thereof, an antigen and/or an antibody according 

15 to the invention can for example be used in a method for the treatment or prevention 
of a SAKS virus infection and/or a' respiratory illness comprising providing an 
individual with a pharmaceutical composition according to the invention. This is most 
useful when said individual comprises a human. Antibodies against SAKS virus 
proteins, especially against the spike protein of SAKS virus, preferably against the 

2 0 amino acid sequence as depicted in translation 2 of EMC7 and translation 1 of the 
RDG seq in figure 2, are also useful for prophylactic or therapeutic purposes, as 
passive vaccines. It is known from other coronaviruses that the spike protein is a very 
strong antigen and that antibodies against spike protein can be used in prophylactic 
and therapeutic vaccination. 

2 5 The invention also provides method to obtain an antiviral agent useful in the 

treatment of atypical pneumonia comprising estabhshing a cell culture or 
experimental animal comprising a virus according to the invention, treating said 
culture or animal with an candidate antiviral agent, and determining the effect of 
said agent on said virus or its infection of said culture or animal. An example of such 

3 0 an antiviral agent comprises a SAKS virus-neutralising antibody, or functional 

component thereof, as provided herein, but antiviral agents of other nature are 
obtained as well. The invention also provides use of an antiviral agent according to 
the invention for the preparation of a pharmaceutical composition, in particular for 
the preparation of a pharmaceutical composition for the treatment of atypical 
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pneumonia, especifically when caused by a SARS virus infection, and provides a 
pharmaceutical composition comprising an antiviral agent according to the invention, 
useful in a method for the treatment or prevention of a SARS virus infection or 
atypical pneumonia, said method comprising providing an individual with such a 
pharmaceutical composition. 

Specifically the invention provides a pharmaceutical composition comprising 
interferon, especially pegylated interferon. 

In general all interferon forms would be useful in the present invention, since 
it is known that all the interferon forms have at least some activity in alleviating 
(symptoms of) viral infection. However, it is to be understood that preferentially the 
interferon is used which is derived from the host which is infected with, or which 
runs the risk of being infected with the virus. Further, most preferred is the use of 
interferon-alpha, and especially - for coronavirus infections that affect humans, like 
SARS - human interferon-alpha. Alpha interferon is a natural protein produced by 
the human body in response to infection. It is also known as interferon alpha-2b. The 
type I interferon alpha family consists of small proteins that have clinically 
important anti-infective and anti-tumor activity. It is understood that alpha 
interferon may be administered alone or in combination with beta interferon or 
gamma interferon. 

Genetic engineering techniques have allowed several companies to mass- 
produce alpha interferon, which is known as recombinant human alpha interferon, or 
by abbreviations such as rhIFN or rIFN-alpha. This is marketed under tradenames 
such as Viraferon (made by Schering-Plough), Roferon-A (by Roche) and Wellferon (by 
Glaxo SmithKhne). Interferon-alpha N3, or Alferon N, is another form of interferon 
alpha, derived from human leukocytes and containing multiple species of interferon- 
alpha. 

A drawback to the use of interferon-alpha as discussed previously, is the short 
serum half-life and rapid clearance of the interferon alpha protein. However, it has 
been shown that the attachment of molecules of polyethylene glycol (PEG) to the 
interferon, creates a barrier that shields the interferon alfa-2a molecule from being 
rapidly degraded by proteases in the body and maintains its ability to consistently 
suppress the target virus over a longer dosage period. 

As already discussed above pegyl'ation of proteins such as inteferon is used to 
prevent rapid removal from thd bloodstream and eventually rapid breakdown of the 
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drug. A prolongation of the serum, half-life of more than a factor two has been 
demonstrated (Shannon A. Marshall, Drug Discovery Today Volume 8, Issue 5 , 
March 2003, Pages 212-221). Pegylated IFN alfa-2b has a prolonged serum half-life 
(40 hours) relative to standard IFN alfa-2b (7-9 hours). The greater size of pegylated 
5 IFN alfa-2a acts to reduce glomerular filtration, markedly prolonging its serum half- 
life (72-96 hours) compared with standard IFN alfa-2a (6-9 hours) (Bruce A. Luxon 
MD Clinical Therapeutics, Volume 24, Issue 9, September 2002, Pages 1363-1383). 

Pegylation of proteins is a standard technique available to a person skilled in 
the art, and standard pegylated interferons are available commercially from Roche 
10 (PEGASYS® (interferon alfa 2a) and Schering-Plough ( PEG-Intron A) or in 

development like PEG-Alfacon, the PEGylated version of Infergen(R) (Interferon 
alfacon-1) a bio-engineered type I interferon alpha. 

Schering-Plough has developed a semi-synthetic form of Intron® A by attaching a 12- 
kDa mono-methoxy polyethylene glycol to the protein (PEG Intron) which fulfils the 

15 requirements of a long-acting interferon alpha protein while providing significant 

clinical benefits. Pegylation decreases the specific activity of the interferon alpha-2b 
protein, whilst the potency of PEG Intron, independent of protein concentration; is 
comparable to the Intron® A standard at both the molecular and cellular level. PEG 
Intron has enhanced pharmacokinetic profile in both animal and human studies [see 

2 0 Yu-Sen Wang et al., 2002: Advanced Drug Delivery Reviews, Volume 54, Issue 4 , 17, 
Pages 547-570]. In PEGASYS, a 40 kilodalton branched, mobile PEG is covalently. 
bound to the interferon alfa-2a molecule and provides a selectively protective barrier 
without significantly reducing binding site receptivity. 

It is understood that pharmacokinetic behaviour of a pegylated molecule 

2 5 depends on the size of the PEG and the structure of the link between the PEG moiety 

and the protein (Shannon A. Marshall, Drug Discovery Today Volume 8, Issue 5 , 
March 2003, Pages 212-221). It is known that interferons with smaller PEGs are 
degraded quickly, requiring more frequent dosing. Thus interferons with larger PEGs 
are preferred. 

3 0 Thus the present invention, encompasses all types of pegylated interferons or 

future interferons with yet undisclosed molecule attachments which provide a 
selectively protective barrier, to shield the interferon from being degraded, without 
significantly reducing binding site receptivity. Also combinations of different 
interferons are encompassed in the invention 
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One of the most preferred embodiments of the present invention is the use of 

interferon as a prophylactic treatment for the prevention of coronavirus infection. 
Subjecting apes to a prophylactic or therapeutic treatment either before or during 
infection with the cornoavirus has a good and useful predictionary value for 
application of such a prophylaxis or therapy in human subjects. 

As is shown in the experimental section administration of interferon before 
infestation with virus particles greatly delays infection and the effects after infection. 
It should be understood that the virus challenge given to the test animals is a high 
dose, which will not or hardly ever occur in 'natural' infections. It is understood that 
viral challenge under 'natural' circumstances would equate with a challenge of about 
10-105 TCID 50 with a concentration which is much less than that used in the 
experiment of the invention. Further, the viral challenge in the experiment was for 
the largest part applied intra-tracheal, i.e. at the place where the virus exerts its 
main infective activity. Normally, a virus will be encountered in the air that is 
breathed and this air will firstly pass the nose and/or oral cavity, where it will have a 
large chance of being filtered out (and stopped) by the epithelium and mucosa of the 
mouth and/or the nose. Anyhow, the fact that even at the extremely high dose used in 
our experiments we have been able to show effect of interferon indicates that the 
effect will even be more pronounced at infective viral doses which are normally 
encountered. It is therefore believed that prophylactic administration will give a 
durable, and strong protection against infection with coronaviruses. 

This is especially important in relation to viruses which are highly infective 
and/or which have an airborne mode of transmission, such as, for instance, the SARS 
virus. A prophylactic treatment would be especially welcome for people who run a 
risk of being infected, such as, in the case of SAES virus, hospital personnel, children, 
elderly and people having an underlying condition such as diabetes or heart disease, 
or a weakened immune system. 

It remains possible; that SARS-CoV infection might be asymptomatic in some 
people, or cause nonrespiratory symptoms in others. There is insufficient evidence to 
exclude the possibility that asymptomatic, or atypical, infected people can transmit 
the disease. Thus a prophylactic treatment for the prevention of coronavirus 
infection, like SARS-CoVis indeed essential. 
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However, our data also show that interferon is also applicable for therapy of 
coronaviruses, i.e. at the time when virus infection is already established. Our in vivo 
data show that pathologic effects are at least delayed upon administration of 
interferon. 

Interferon of human and murine origins has been quantified in the art in 
terms of International Units ("IU"). As used herein, a "unit" of interferon (to be 
distinguished from "IU") shall mean the reciprocal of a dilution of interferon- 
containing material that, as determined by assay, inhibits one-half the number of 
plaques of a challenge virus, the challenge virus being the vesicular stomatitis virus 
("VSV"). So quantified a "unit" of interferon is routinely found to be about one-tenth 
the quantity of interferon represented by one "IU. " Alternatively, interferon can be 
quamtitated in ug/kgof body weight. 

Interferon is given in doses ranging from l|ig/kg to 3ug/kg. When the 
interferon is pegylated doses can be delivered less frequently. Treatment of a 
coronavirus disease in accordance with the present invention comprises 
administering pegylated interferon at a dosage of 0.01-6u.g/kg per day in a dosage 
form adapted to promote contact of said dosage of interferon with the oral and 
pharyngeal mucosa of said animal. Preferably, the dosage of interferon is from 0.1- 
4ug/kg per day, more preferably 0.3-3|.ig/kg per day. 

Interferon may be administered by any available means, including but not 
limited to, oral, intravenous, intramuscular, pulmonary and nasal routes, and 
wherein said composition is present as a solution, a suspension or an aerosol spray, 
especially of fine particles. 

It is critical that the pegylated interferon be administered in a dosage form 
adapted to assure maximum contact of the interferon in said dosage form with the 
oral and pharyngeal mucosa of the human or animal, undergoing treatment. Contact 
of interferon with the mucosa can be enhanced by maximizing residence time of the 
treatment solution in the oral or pharyngeal cavity. Thus, best results seem to be 
achieved in human patients when' the patient is requested to hold said solution of 
interferon in the mouth for a period of time. Contact of interferon with the oral and 
pharyngeal mucosa and thereafter with the lymphatic system of the treated human 
or animal avian, rodent is unquestionably the most efficient method administering 
immunotherapeutic amounts of pegylated interferon. 
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For example interferon can be administered in either a liquid (solution) or 
solid dosage form. Thus interferon can be administered dissolved in a buffered 
aqueous solution typically containing a stabilizing amount (1-5% by weight) of blood 
serums. Exemplary of a buffered solution suitable as a carrier of interferon 
administered in accordance with this invention is phosphate buffered saline prepared 
by standard techniques. 

It is also contemplated by the present invention to provide interferon in a solid 
dosage form such as a lozenge adapted to be dissolved upon contact with saliva in the 
mouth with or without the assistance of chewing. Such a unitary dosage form is 
formulated to release about 1 to about 1500 IU of interferon upon dissolution in the 
mouth for contact with the oral and pharyngeal mucosa. Thus a unitary dosage form 
of interferon in accordance with this invention can be prepared by art-recognized 
techniques for forming compressed tablets such as chewable vitamins. Similarly, 
interferon can be incorporated into starch-based gel formulations to form a lozenge 
which will dissolve and release interferon for contact with the oral mucosa when held 
in the mouth. Solid unitary dosage forms of interferon for use in accordance with the 
present invention can be prepared utilizing art recognized dosage formulation 
techniques. The pH of such formulations can range from about 4 to about 8.5. Of 
course, in processing to such unitary dosage forms one should avoid heating a pre- 
dosage form formulation, after addition of interferon, above about 50°C. Exemplary of 
a solid dosage form for animal use is a molasses block containing effective amounts of 
interferon. 

Alternatively the interferon can be formulated into flavoured or unfavoured solutions 
or syrups using a buffered aqueous solution of interferon as a base with added caloric 
or non-caloric sweeteners, flavour oils and pharmaceutically acceptable 
surfactant/dispersants. 

Also contemplated are methods of gene therapy capable of causing expression 
of interferon in respiratory or gastric cells for prevention of a coronaviral infection. 

Of course, the clinical use of any medicament of the present. invention is a 
clinical decision to be made by the clinician and the exact course of such treatment is 
left to the clinician's sound discretion, with all svtch courses of treatment deemed 
within the bounds of the present invention. 

Another preferred embodiment is administration of interferon together with 
another treatment which is directed to prevent or treat infection with coronaviruses. 
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Such other treatment can for instance be a vaccine, antibody and/or anti- viral agent 
selected from the group consisting of whole inactivated virus vaccines, attenuated 
vaccines, sub-unit vaccines, recombinant vaccines, antibody for passive 
immunization, nucleoside analogs such as ribavirin, RNA-dependent RNA 
5 polymerase inhibitors and protease inhibitors. 

Use of interferon together with administration of a vaccine will boost the 
effects of the vaccine. First of all, there is the combination of treatments that will add 
up to a better effect. However, co-administration of interferon with a vaccine will also 
enable the immune response to vaccination to have more effect. Normally the 

10 immune response is slow and it takes a few days to come to a high enough titer of 
antibodies to be able to effectively combat virus particles. When no interferon is co- 
administered the virus would have had the chance to multiply to enormous amounts, 
which cannot be overcome by the immune response. With interferon, however, the 
amounts of the virus will remain absent or low and any infective virus outburst (if 

15 any at all) can easily be handled by the immune system. 

Treatment to prevent and/or treat infection with coronaviruses can also 
comprise combination treatments with other antiviral compounds, such as, for 
instance, nucleoside-based compounds such as ribavirin (e.g. Rebetol® (ribavirin, 
USP). These compounds act through interfering with the viral replication by 

2 0 presenting nucleosides which are built in during viral replication, but which either 

prevent formation of viral proteins or which do not yield functional proteins. Co- 
administration of interferon will even more slow down viral replication. 
Combinations of ribavirin and forms of interferon can help to reduce viral load. 
Another disease condition responding to treatment in accordance with the 
25 present invention is neoplastic disease. Thus, the administration of interferon in 

accordance with the above description can, alone or in combination with other drugs 
or therapy, help effect remission of cancers such as malignant lymphoma, melanoma, 
mesothelioma, Burkitt lymphoma and nasopharyngeal carcinoma and.other 
neoplastic diseases, especially those of known or suspected viral etiology and diseases 

3 0 such as Hodgkin's Disease and leukemia. 

Other disease conditions responding to treatment in accordance with the 
present invention are infectious diseases of coronaviral origin in human, avian, 
porcine, canine and feline species. Human coronavirises are coronivixus 229E and the 
newly discovered coronavirus HcoV-NL (see European patent.applicatipn 03078772.5. 
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Several other coronaviruses can cause fatal systemic diseases in animals, including 
feline infectious peritonitis virus (FIPV), hemagglutinating encephalomyelitis virus 
(HEV) of swine, and some strains of avian infectious bronchitis virus (IBV) and 
mouse hepatitis virus (MHV). These coronaviruses can replicate in liver, lung, 
kidney, gut, spleen, brain, spinal cord, retina, and other tissues. Immunopathology 
plays a role in tissue damage in MHV and FIPV, and cytokines are responsible for 
some signs of disease. Significantly, in cats with persistent, inapparent infection with 
feline enterotropic coronavirus, virulent virus mutants can arise and cause fatal 
infectious peritonitis, a systemic disease. 

The invention also comprises an animal model usable for testing of 
prophylactic and/or therapeutic methods and/or preparations. It has appeared that 
apes can be infected with the SARS virus, thereby showing clinical symptoms, and 
more importantly, similar tissue morphology as found in humans suffering from 
atypical pneumonia caused by the SAKS virus. Subjecting apes to a prophylactic or 
therapeutic treatment either before or during infection with the virus will have a 
good and useful predictionary value for application of such a prophylaxis or therapy 
in human subjects. 

The invention is further explained in the Examples without limiting it thereto. 



WO 2004/089983 PCT/NL2004/000229 



Figure legends 

Fig. 1: Phylogenetic relationship for the nucleotide sequences of isolate HK39849 
with its closest relatives genetically. Phylogenetic trees were generated by maximum 
5 likelihood analyses using 100 bootstraps and 3 jumbles. The scale representing the 
number of nucleotide changes is shown for each tree. 



Fig. 2: Nucleotide sequences from 13 clones of parts of the SARS virus. Also included 
are the putative polypeptide sequences of polypeptides and alignments of the putative 
1 0 polypeptides with that of another member of the Coronoviridae family, where 
possible. 

Fig. 3: Schematic map of the SAES virus genome, indicating the position of the 

nucleotide sequences of figure 2 relative to the genome and a putative indication of 
15 the open reading frames of the genome based on analogy with other coronaviruses. 

The gene structure for the region between the Spike and Nucleocapsid is uncertain. 

EMC1-EMC14 and RDG 1: sequences as provided in figure 2. CDC and BINl-2: 

sequences were provided through personal communication from the CDC (Dr. W. 

Bellini, Centers for Disease Control & Prevention, National Centers for Infectious 
2 0 Diseases, 1600 Clifton Road, Atlanta GA 30333, USA) and BNI (Dr. C . Drosten and 

Prof. Dr. H. Schmitz, Bernard Nocht Institute, Bernard-Nocht Str. 74, D-20359 

Hamburg, Germany), respectively. 

Fig. 4: Amino acid comparison of the N-terminus of the S-protein of the SARS virus 
25 and closely related coronaviruses. HCV OC43 = human coronavirus isolate OC43; 
MHV A59 = murine hepatitis virus isolate A59, BCV = bovine corona virus. 



Fig. 5: Negative contrast EM photograph of SARS virus obtained from concentrated 
supernatant of infected cell cultures. 

Fig. 6: Infection with SARS-coronavirus causes pulmonary and renal lesions in 
cynomolgus macaques. FormaUn-fixed, paraffin-embedded tissues were stained with 
haematoxylin and eosin and examined by light microscopy. There is diffuse alveolar 
damage of the lung (a), and the alveolar lumina (b) are flooded with highly 
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proteinaceous exudate admixed with inflammatory cells and cellular debris. In the 
lumen of a bronchiole (c) and in the surrounding lung parenchyma are several 
multinucleated syncytial cells (arrowheads). The renal collecting tubules (d) contain 
similar multinucleated syncytial cells. Original magnifications: a x 12.5; b x 50; c x 
5 100; d x 250. 

Fig 7: Infection of domestic cats and ferrets with SCV. Cats (A, n=6) and ferrets (B, 
n=6) were infected with 10 6 TCIDso via the respiratory route and secretion of SCV in 
pharyngeal swabs was quantified by real time PGR. Four animals per group were 
1 0 euthanised at day 4 while the other two were analysed till day 28. SCV secretion in 
non-infected cats (C, n=2) and non-infected ferrets (D, n=2) exposed to SCV infected 
cats and ferrets, respectively. Real time PCR results are shown relative to a titrated 
SCV standard and shown as TCIDso/ml (N.D. not done). 

1 5 Fig. 8: Detection of SCV in postmortem tissues of experimentally SCV infected cats 
and ferrets 

Fig. 9: Effect of pegylated IFN-a on SARS Coronavdrus (SCV) replication in 
2 0 macaques. SCV detection in pharyngeal swabs (days 0, 2 and 4 after infection, closed 
bars) and lungs (day 4, open bars) taken from cynomolgus monkeys treated with PBS 
(A), PEG-Intron at days -3, -1, +1 and +3 (B and C) and PEG-Intron at days +1 and 
+3 (D) after SCV infection. Individual macaques are shown (n=2 per group). Virus 
isolation (VT) results are indicated in the lower part of the panel whereas real time 

2 5 PCR results are shown in the upper part of the panels (n.a., not available). 

Fig. 10 Nucleotide sequence of SARS Corona virus Genbank accession nr. AY274119 

Fig. 1 1 Antiviral activity of pegylated IFN-a against SCV in vitro and its biological 

3 0 activity in cynomolgus macaques, (a) Effect of pegylated IFN-a against SCV infection 

in vitro. Similar results were obtained in 3 separate experiments, (b) 
Pharmacokinetic analysis of pegylated IFN-a in macaques treated with PBS (control 
group; open squares, n = 4) or pegylated IFN-a (prophylactic group; closed squares, n 
= 4) at days -3 and -1. (c) Induction of neopterin in macaques treated with PBS 
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(control group; open squares, n = 4) or pegylated IFN-a (prophylactic group; closed 
squares, n = 4) at days -3 and -1. Data are expressed as mean ± s.d.; **, P <0.01 
versus, control. 

5 Fig. 12 Effect of pegylated IFN-a on SCV excretion in cynomolgus macaques. SCV 
detection in pharyngeal swabs taken at 0, 2, or 4 d.p.i. from macaques treated with 
PBS (control group, n = 4), pegylated IFN-a prophylactically (n = 6) or post-exposure 
(n =4). Data are expressed as mean ± s.d.; * P <0.05 versus control group at 2 d.p.i., 
**, P.<0.01 versus control group at 2 d.p.i.. 

10 

Fig. 13 Effect of pegylated IFN-a on SCV replication, viral antigen expression and 
histological lesions in the lungs of SCV-infected cynomolgus macaques, (a) SCV 
titration of lung homogenates. (b) Immunohistochemical detection of SCV-infected 
cells in lung sections, (c) Histopathological score of lung sections. SCV-infected 
1 5 macaques were treated with pegylated IFN-a prophylactically (n = 4) or post- 
exposure (n = 4), or treated with PBS (control group, n = 4). Data are expressed as 
mean ± s.d.; *, P <0.05 versus control group; **, P <0.01 versus control group. 
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Examples 

Example 1. Virus isolation and characterisation 

Isolation 

5 Isolate HK39849 was isolated from a hospitalised SARS patient by throat 

swab and inoculated into a culture of Verp-E6 cells. A sample of the supernatant from 
these infected cells provided by Dr. M. Peiris (Queeen Mary Hospital Faculty of 
Medicine, Hong Kong University, Honk Kong) was used to inoculate VERO-118 cells 
and cell culture supernatant from these cells was aliquoted and frozen after one 
10 passage. 

We isolated RNA from the virus-containing cell culture supernatant and 
subjected it to RNA arbitrarily primed PCR (RAP-PCR) essentially as described by 
Welsh & McClelland (NAR 18:7213; PNAS USA 90:10710, 1993). Virus in the culture 
supernatants was purified on continuous 20-60% sucrose gradients. The gradient 

15 fractions were inspected for virus-like particles by EM, and RNA was isolated from 
the fraction containing , in which the most nucleocapsids were observed. Equivalent 
amounts of RNA isolated from virus fractions were used for RAP-PCR, after which 
samples were run side by side on a 3% NuSieve agarose gel. Differentially displayed 
bands ranging in size from 200-1500 base pairs specific for the unidentified virus 

2 0 were subsequently purified from the gel, cloned in plasmid pCR2.1 (Invitrogen) and 
sequenced with vector-specific primers. When we used these sequences to search for 
homologies against sequences in the Genbank database using the BLAST software 
(www.ncbi.nlm.nih.gov/BLAST/) which yielded resemblance to virus sequences of the 
coronaviruses displayed in the phylogenetic tree of figure 1. 

2 5 Eight of these fragments (EMC 1-6, 13 and 14) were located in the ORF coding for the 

viral polymerase (ORF lab), one (EMC-7) spanned the 3 1 end of ORFlab and reached 
into the 5' end of spike protein region; EMC-10 overlapped the 3' end of EMC-7 and 
therefore also codes part of the S protein region and EMC 9 encodes a region 
downstream of EMC-10; by use of primers to sequences within EMC 10 and EMC9 

3 0 (see below), the region between these two sequences was amplified by PCR and 

sequenced. The full contiguous region has been incoporated into EMC7 in firgure 2;a 
further sequence (RDG1 in figure 2) encodes the 3' end of the Spike protein. A further 
sequence (EMC8) spanned part of the Nucleocapsid coding sequence. The remaining 
three sequences (EMC9, 11 and 12) have in the meantime been found to be regions of 
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the orf lab/replicase, where emc 9 is incorporated in emc 11. This has not yet been 
reflected in figure 3. 

Phylogeny 

BLAST searches using nucleotide sequences obtained from the unidentified 
virus isolate revealed homologies primarily with members of the Coronaviridae. As 
an indication for the relation between the newly identified virus isolate and other 
coronaviruses a phylogenetic tree was constructed based on the sequence information 
obtained (figure 1). 

Materials and Methods 

Specimen collection 

Virus was collected from SAES patients using throat swabs and from 
experimentally infected monkeys (throat and nasal swabs, serum, plasma and faeces) 

Virus isolation and culture 

Throat swabs were dipped into a culture of Vero-E6 cells and incubated for 1-4 
days. Cell culture supernatant was clarified by centrifugation and filtered through a 
0.45micrometre niter, before beings stored frozen. The virus was subsequently 
propagated in Vero-118 cells. 

Antigen detection by indirect IFA 

Samples from experimentally infected monkeys was cultured on Vero-118 cells 
in 24 well plates containing glass slides. These glass slides were washed with PBS 
and fixed in aceton for 1 minute at room temperature. After washing with PBS the 
slides were incubated for 30 minutes at 37 °C with SAES-antibody containing serum 
from SARS patients. After washing off the human serum in PBS, the slides were 
incubated at 37°C for 30 minutes with PITC labeled anti-human antibodies. After 
three washes in PBS and one in tap water, the slides were included in a glycerol/PBS 
solution (Citifluor, UKC, Canterbury, UK) and covered. The slides were analysed 
using an Axioscop fluorescence microscope (Carl Zeiss B.V., Weesp, the Netherlands). 



Detection of antibodies in humans by indirect IFA 
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Virus was cultured on Vero-118 cells in 24 well plates containing glass slides. 
These glass slides were washed with PBS and fixed in aceton for 1 minute at room 
temperature. After washing with PBS the slides were incubated for 30 minutes at 37 
°C with SARS-antibody containing serum from SARS patients. After washing off the 
human serum in PBS, the slides were incubated at 37°C for 30 minutes with FITC 
labeled anti-human antibodies. After three washes in PBS and one in tap water, the 
slides were included in a glycerol/PBS solution (Citifluor, UKC, Canterbury, UK) and 
covered. The slides were analysed using an Axioscop fluorescence microscope (Carl 
Zeiss' B.V., Weesp, the Netherlands 

Detection of antibodies in humans byELISA 
Patient samples. 

4 samples of patients with SARS disease , 8 samples of patients from routine 
serological virology; samples from an experimentally infected monkey (preserum, 9 
and 12 days after infection ). 

The Conjugate. 

Whole virus was used as the conjugate.. Tissue culture supernatant from 
infected Vero cells were pelleted through 20% sucrose onto a 60% sucrose cushion. 
The virus was then pelleted through 20% sucrose and resuspended in PBS/1% NP40. 
After dialysis using PBS, the virus was The conjugated to horseradish peroxidase by 
standard techniques was tested in 3 concentrations (diluted in dilution buffer 9000- 
03, 1:100, 1:400 and 1:1600), both on polyvalent anti-IgM code MCB0201 (cross- 
reactive with monkey) and monoclonal anti-IgM, code 9000-62 (non-crossreactive 
with monkey). 

Sera were diluted 1:200 in serum diluent (code 9000-03), monkey 775 was 
diluted 1: 100, 1:200 and 1:400. 

Serum incubation one hour at 37°C, conjugate incubation one hour at 37°C, 
and TMB (ready to use): 30 minutes at room temperature. The reaction was stopped 
with sulphuric acid (0.5M). 
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Virus characterisation 

For EM analyses, virus was concentrated from infected cell culture 
supernatants in a micro-centrifuge at 4 °C at 17000 x g, after which the pellet was 
5 resuspended in PBS and inspected by negative contrast EM 

RNA isolation 

. RNA was isolated from the supernatant of infected cell cultures or sucrose 
1 0 gradient fractions using a High Pure RNA Isolation kit according to instructions from 
the manufacturer (Roche Diagnostics, Almere, The Netherlands). 

RT-PCR 

A one-step RT-PCR was performed in 50 (.il reactions containing 50 mM 
15 Tris.HClpH 8.5, 50 mM NaCl, 4 mM MgC12, 2 mM dithiotreitol, 200 uM each dNTP, 
10 units recombinant RNAsin (Promega, Leiden, the Netherlands), 10 units AMV RT 
(Promega, Leiden, The Netherlands), 5 units Amphtaq Gold DNA polymerase (PE 
Biosystems, Nieuwerkerk aan de Ijssel, The Netherlands) and 5 ul RNA. Cycling 
conditions were 45 min. at 42 °C and 7 min. at 95 °C once, 1 min at 95 °C, 2 min. at 

2 0 42 °C and 3 min. at 72 °C repeated 40 times and 10 min. at 72 °C once. 

Primers used for diagnostic PCR: 
SARSfwd2: ggtggaacatcatccggtgat 
SARS rev2: agcctgtgttgtagattgcgg 

These primers amplify a 149bp fragment of the polymerase gene (orf lab) 
25 RF999: TTTAAACACTTACGAGAGTTTGTG 
RF997: GGACACAACCCATGAAATCATCTGG 

These primers amplify a region of 728bp in the spike glycoprotein gene (S) 
RF998: AGACATATCTAATGTGCCTTTCTCC RF1002: 

AAGCTCGTCACCTAAGTCATAAGAC (from EMC 11 sequence) 

3 0 The combination of RF998/RF1002 primers enabled us to sequence the 3' end 

of EMC7 - RF998 is a specific primer withing EMC7 whereas EMC1002 acted as a 
random primer. 

RT-PCR, gel purification and direct sequencing were performed as described 

above. 
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RAP-PCR 

RAP-PCR was performed essentially as described by Welsh & McClelland (Nuc. Acid 
5 Res. 18:7213, 1990; Proc. Natl. Acad. Sci. USA 90:10710 1993) . The ohgonucleotide 
sequences are described in addenda 2. For the RT reaction, 2 ul RNA was used in a 
10 ul reaction containing 10 ng/ul ohgonucleotide, 10 mM dithiotreitol, 500 am each 
dNTP, 25 mM Tris-HCl pH 8.3, 75 mM KC1 and 3 mM MgC12. The reaction mixture 
was incubated for 5 min. at 70 °C and 5 min. at 37 °C, after which 200 units 

10 Superscript RT enzyme (LifeTechnologies) were added. The incubation at 37 °C was 
continued for 55 min. and the reaction terminated by a 5 min. incubation at 72 °C. 
The RT mixture was diluted to give a 50 ul PCR reaction containing 8 ng/ul 
ohgonucleotide, 300 urn each dNTP, 15 mM Tris-HCL pH 8.3, 65 mM KC1, 3.0 mM 
MgCL.2 and 5 units Taq DNA polymerase (PE Biosystems). Cycling conditions were 5 

15 min. at 94 °C, 5 min. at 40 °C and 1 min. at 72 °C once, followed by 1 min. at 94 °C, 2 
min. at 56 °C and 1 min. at 72 °C repeated 40 times and 5 min. at 72°C once. After 
RAP-PCR, 15 (4,1 the RT-PCR products were run side by side on a 3% NuSieve agarose 
gel (FMC BioProducts, Heerhugowaard, The Netherlands). Differentially displayed 
fragments were purified from the gel with Qiaquick Gel Extraction kit (Qiagen, 

2 0 Leusden, The Netherlands) and cloned in pCR2.1 vector (Invitrogen, Groningen, The 
Netherlands) according to instructions from the manufacterer. 

Sequence analysis 

RAP-PCR products cloned in vector pCR2.1 (Invitrogen) were sequenced with M13- 

2 5 specific oligonucleotides. DNA fragments obtained by RT-PCR were purified from 

agarose gels using Qiaquick Gel Extraction kit (Qiagen, Leusden, The Netherlands), 
and sequenced directly with the same oligonucleotides used for PCR, Sequence 
analyses were performed using a Dyenamic ET terminator sequencing kit 
(Amersham Pharmacia Biotech, Roosendaal, The Netherlands) and an ABI 373 

3 0 automatic DNA sequencer (PE Biosystem). All techniques were performed according 

to the instructions of the manufacturer. 

RT-PCR for diagnosing SARS uiras. 

For the amplification of the SARS virus' genetic material, we used primers: 
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SARS fwd2: ggtggaacatcatccggtgat 
SARS rev2: agcctgtgttgtagattgcgg 

These primers amplify a 149bp fragment of the polymerase gene (orf lab) 
RF 999: TTTAAACACTTACGAGAGTTTGTG 
5 RF997: GGACACAACCCATGAAATCATCTGG 

These primers amplify a region of 728bp in the spike glycoprotein gene (S). 

These primers amplify a 149bp fragment of the polymerase gene (orf lab) 
RT-PCR, gel purification and direct sequencing were performed as described above. 

10 

Phylogenetic analyses 

For all phylogenetic trees, DNA sequences were alhgned using the ClustalW software 
package and maximum likelihood trees were generated using the DNA-ML software 
15 package of the Phylip 3.5 program using 100 bootstraps and 3 jumbles 15 . Previously 
published sequences for TGEV, PEDV, 229E, AIBV, BoCo and MHV that were used 
for the generation of phylogenetic trees are available from Genbank 

Example 2: Methods to identify SARS virus 
2 0 Specimen collection 

In order to find virus isolates nasopharyngeal aspirates, throat and nasal swabs, 
broncheo alveolar lavages, serum and plasma samples, and stools preferably from 
mammals such as humans, carnivores (dogs, cats, musteUits, seals etc.), horses, 

2 5 ruminants (cattle, sheep, goats etc.), pigs, rabbits, birds (poultry, ostriches, etc) 

should be examined. From birds cloaca swabs and droppings can be examined as well. 
Sera should be collected for immunological assays, such as ELISA, molecular-based 
assays, such as RT-PCR and virus neutralisation assays. 
Collected virus specimens were diluted with 5 ml Dulbecco MEM medium 

3 0 (BioWhittaker, Walkersville, MD) and thoroughly mixed on a Vortex mixer for one 

minute. The suspension was thus centrifuged for ten minutes at 840 x g. The 
sediment was spread on a multispot slide (Nutacon, Leimuiden, The Netherlands) for 
immunofluorescence techniques, and the supernatant was used for virus isolation. 
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Virus isolation 



For virus isolation Vero- 118 cells or tMK cells (FJVM, Bilthoven, The Netherlands) 
were cultured in 24 well plates containing glass shdes (Costar, Cambridge, UK), with 
5 the medium described below supplemented with 10% fetal bovine serum 

(BioWhittaker, Vervier, Belgium). Before inoculation the plates were washed with 
PBS and supplied with Eagle's MEM with Hanks' salt (ICN, Costa mesa, CA) 
supplemented with 0.52/hter gram NaHCOs , 0.025 M Hepes (Biowhittaker), 2 mM L- 
glutainine (Biowhittaker), 200 units/liter penicilhne, 200 ug/liter streptomycine 

1 0 (Biowhittaker), lgram/liter lactalbumine (Sigma-Aldrich, Zwijndrecht, The 

Netherlands), 2.0 gram/liter D-glucose (Merck, Amsterdam, The Netherlands), 10 
gram/liter peptone (Oxoid, Haarlem, The Netherlands) and 0.02% trypsine (Life 
Technologies, Bethesda, MD). The plates were inoculated with supernatant of the 
patient samples, 0,2 ml per well in triplicate, followed by centrifuging at 840x g for 

15 one hour. After inoculation the plates were incubated at 37 °C for a maximum of 1-3 
days and cultures were checked daily for CPE. Extensive CPE was generally observed 
within 24hours. and included detachment of cells from the monolayer.. 

Virus culture of SARS 
2 0 Sub-confluent monolayers of tMK cells or Vero clone 118 cells in media as 

described above were inoculated with supernatants of samples that displayed CPE or 
with samples taken from patient or artificially infected monkeys.. 

Virus characterisation 

2 5 For EM analyses, virus was concentrated from infected cell culture 

supernatants in a micro-centrifuge at 4 °C at 17000 x g, after which the pellet was 
resuspended in PBS and inspected by negative contrast EM. 
Antigen detection by indirect IFA 

3 0 Virus was cultured on Vero- 118 cells in 24 well shdes containing glass shdes. These 

glass shdes were washed with PBS and fixed in aceton for 1 minute at room 
temperature. 

After washing with PBS the shdes were incubated for 30 minutes at 37 °C with SAES 
patient serum. We used patient serum, but antibodies can be raised in various 
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animals, such as ferrets, goats and rabbits (for polyclonal antibodies) and mice and 
hamsters (for monoclonal antibodies), and the working dilution of the antibody can 
vary for each immunisation. After three washes with PBS and one wash with tap 
water, the slides were incubated at 37°C for 30 minutes with FITC labeled goat-anti- 
5 human antibodies. After three washes in PBS and one in tap water, the slides were 
included in a glycerol/PBS solution (CitifLuor, UKC, Canterbury, UK) and covered. 
The slides were analysed using an Axioscop fluorescence microscope (Carl Zeiss B.V., 
Weesp, the Netherlands). 

10 Detection of antibodies in humans by indirect TEA 

For the detection of virus specific antibodies, SAKS virus-infected Vero cells 
were fixed with acetone on coverslips (as described above), washed with PBS and 
incubated 30 minutes at 37°C with serum samples at a 1 to 16 dilution. After two 
washes with PBS and one with tap water, the slides were incubated 30 minutes at 

1 5 37°C with FITC-labelled secondary antibodies to human antibodies (Dako). Slides 
were processed as described above. 

Antibodies can be labelled directly with a fluorescent dye, which will result in a direct 
immuno fluorescence assay. FITC can be replaced with any fluorescent dye. This 
technique can be applied to antibodies in other animals such as mammals, 

2 0 ruminants, birds or other species, assuming the secondary antibody to the 

appropriate species is used. 

Detection of antibodies in humans by ELISA 
Patient samples. 

25 4 samples of patients with SABS; 8 samples of patients from routine 

serological virology; samples from an experimentally infected monkey (preserum and 
9 days after infection) . 

The Conjugate. 

3 0 The conjugate was tested at a number of concentrations, both on polyvalent 

anti-IgM (cross-reactive with monkey) and monoclonal anti-IgM, (non-crossre active 
with monkey). 
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Sera were diluted 1:200 in serum diluent and the monkey serum was diluted 1: 100 
1:200 and 1:400. 

Serum incubation one hour at 37°C, conjugate incubation one hour at 37°C, and TMB 
(ready to use): 30 minutes at ropm temperature. The reaction was stopped with 
5 sulphuric acid (0.5M). 

Results were interpreted by eye. Three of the four SARS-IgM positive sera (as 
detected by IF on infected cells) had a higher score than negative control sera.One 
serum had a score which was also reached by some of the negative controls. The 9 day 
10 old monkey sera did not react, but the 12 day old did. Thus, this study shows that 

with direct conjugation of nucleocapsids the developemnt of an IgM capture method is 
feasable. 

Furthermore, this type of assay can be performed in a number of formats by those 
trained in the art. The assay can be extended to the detection of IgA and IgG 
15 antibodies from humans and animals and can make use of different capture antigens, 

such as, but not limited to, purified recombinant N protein. 

Animal immunisation 

Cynomologous macaque specific antisera for the newly discovered virus were 
2 0 generated by experimental intratrachael installation of cultured virus of 

Cynomologous macaques. One to two weeks later the animals were bled. The sera 
were tested for reactivity to SARS virus by indirect IFA as described above; 
uninfected control cells were used to ensure the specificity of the serum. Other 
animal species are also suitable for the generation of specific antibody preparations 

2 5 and other antigen preparations may be used. 

RNA isolation 

RNA was isolated from the supernatant of infected cell cultures or sucrose 

3 0 gradient fractions using a High Pure RNA Isolation kit according to instructions from 

the manufacturer (Roche Diagnostics, Almere, The Netherlands). RNA can also be 
isolated following other procedures known in the field (Current Protocols in Molecular 
Biology). 
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RT-PCR 

A one-step RT-PCR was performed in 50 ul reactions containing 50 mM Tris.HCl pH 
8.5, 50 mM NaCl, 4 mM MgCl 2j 2 mM dithiotreitol, 200 uM each dNTP, 10 units 
recombinant RNAsin (Promega, Leiden, the Netherlands), 10 units AMV RT 
5 (Promega, Leiden, The Netherlands), 5 units Amplitaq Gold DNA polymerase (PE 
Biosystems, Nieuwerkerk aan de Ijssel, The Netherlands) and 5 ul RNA. Cychng 
conditions were 45 min. at 42 °C and 7 min. at 95 °C once, 1 min at 95 °C, 2 min. at 
42 °C and 3 min. at 72 °C repeated 40 times and 10 min. at 72 °C once. 
Primers used for diagnostic PCR: 

10 

For the amplification of the SARS virus' genetic material, we used primers: 
SARS fwd2: ggtggaacatcatccggtgat 
SARS rev2: agcctgtgttgtagattgcgg 

These primers amplify a 149bp fragment of the polymerase gene (orf lab) 
1 5 RT-PCR, gel purification and direct sequencing were performed as described above. 

Sequence analysis 

Sequence analyses were performed using a Dyenamic ET terminator 
sequencing kit (Amersham Pharmacia Biotech, Roosertdaal, The Netherlands) and an 
2 0 ABI 373 automatic DNA sequencer (PE Biosystem). All techniques were performed 
according to the instructions of the manufacturer. PCR fragments were sequenced 
directly with the same oligonucleotides used for PCR, or the fragments were purified 
from the gel with Qiaquick Gel Extraction kit (Qiagen, Leusden, The Netherlands) 
and cloned in pCR2.1 vector (Invitrogen, Groningen, The Netherlands) according to 

2 5 instructions from the manufacturer and subsequently sequenced with M13-specific 

oligonucleotides. 

Detection of antibodies in humans, mammals, ruminants or other animals by ELISA 

3 0 A recombinant protein derived from the SARS virus is preferred as the 

antigen. However, purified nucleocapsids may also be used. Antigens suitable for 
antibody detection include any SARS protein that combines with any SARS-specific 
antibody of a patient exposed to or infected with SARS virus. Preferred antigens of 
the invention include those that predominantly engender the immune response in 
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patients exposed to SAES, which therefore, typically are recognised most readily by 

antibodies of a patient. Particularly preferred antigens include the N, and S proteins 
ofSARS. 

Antigens used for immunological techniques can be native antigens or can be 
modified versions thereof. Well known techniques of molecular biology can be used to 
alter the amino acid sequence of a SARS antigen to produce modified versions of the 
antigen that may be used in immunologic techniques. 

Methods for cloning genes, for manipulating the genes to and from expression 
vectors, and for expressing the protein encoded by the gene in a heterologous host are 
well-known, and these techniques can be used to provide the expression vectors, host 
cells, and the for expressing cloned genes encoding antigens in a host to produce 
recombinant antigens for use in diagnostic assays. See for instance: Molecular 
cloning, A laboratory manual and Current Protocols in Molecular Biology. 

A variety of expression systems may be used to produce SARS antigens. For 
instance, a variety of expression vectors suitable to produce proteins in E.Coli, 
B.subtilis, yeast, insect cells and mammalian cells have been described, any of which 
might be used to produce a SARS antigen suitable to detect anti- SARS antibodies in 
exposed patients. 

The baculovirus expression system has the advantage of providing necessary 
processing of proteins, and is therefor preferred. The system utilizes the polyhedrin 
promoter to direct expression of SARS antigens. (Matsuura et al. 1987, j.Gen.Virol. 
68: 1233-1250). 

Antigens produced by recombinant baculo-viruses can be used in a variety of 
immunological assays to detect anti- SARS antibodies in a patient. It is well 
established, that recombinant antigens can be used in place of natural virus in 
practically any immunological assay for detection of virus specific antibodies. 
The assays include direct and indirect assays, sandwich assays, solid phase assays 
such as those using plates or beads among others, and liquid phase assays. Assays 
suitable include those that use primary and secondary antibodies, and those that use 
antibody binding reagents such as protein A. Moreover, a variety of detection 
methods can be used in the invention, including colorimetric, fluorescent, 
phosphorescent, chemiluminescent, luminescent and radioactive methods. 
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Example 3: Animal models 

Macaques 

Four Cynomologous Macaques were infected with SAKS virus by intratrachaeal 
installation using Vero-118 cell derived virus. 
5 - 
The monkeys had the following clinical symptoms 
o Lethargy 

o One of four monkeys had severe pneumonia 
<o ' Mild to severe rash in the inguinal region and the axilar region 
10 • Watery stools 

After 10-16 days the monkeys were euthanized. Tissues were examined and the 
following was found 

• Alveolae were filled with serum and their architecture were disrupted, 
1 5 consistent with bronchointestitial pneumonia (see fig 5 and b) 

• Multi-cell syncytia in lungs (fig 5c) 

• Multi-cell syncytia in kidneys (fig 5 d) 

• Widening of the small intestine 

2 0 Virus was detected using RT-PCR on tissue samples and by culturing samples 
followed by electron microscopy from 

• Lungs 

• Nasal swabs 

• Throat swabs 
25 • Faeces 

• Kidneys 

The EM results demonstrate that the virus that was recovered from the 
Cynomologous Macaques had the identical morphology to the virus which Was used to 
infect them. 

30 

This demonstrates that Cynomologous Macaques may be used as animal models to 
tests the efficacy of pharmaceutical preparations for therapeutic or prophylactic 
purposes 



1SDOCID: <WO__2004089983A2J_> 



BNSpage42 



WO 2004/089983 PC17NL2004/000229 
42 

Cats and ferrets 

Domestic cats (n = 6) and ferrets (n = 6) were inoculated intratracheally with 
106 median tissue culture infectious dose (TCID50) SCV, obtained from patient 5688 
who died from SARS and passaged four times on Vero 118 cells in vitro. Nasal, 
5 pharyngeal and rectal swabs were taken on different days post infection (d.p.i.)- Four 
animals of each group were euthanised at 4 d.p.i. and necropsy was performed 
according to a standard protocol. No clinical signs were observed in SCV-inoculated 
cats, while three out of six ferrets became lethargic from 2 to 4 d.p.i. and one of these 
ferrets died at 4 d.p.i. All cats and ferrets (Fig. 7) shed SCV from the pharynx 

10 starting at 2 d.p.i. until day 10 and 14, respectively, as determined by RT-PCR. Virus 
was isolated from all pharyngeal swabs taken on 2-8 d.p.i. and nasal swabs of two 
cats on 4 and 6 d.p.i.. SCV was detected neither in nasal swabs from ferrets nor in 
rectal swabs of cats or ferrets. Infection of the respiratory tract was evident in all 
animals tested; SCV could be isolated from their tracheas and lungs (Fig 8). 

1 5 Quantification of the mean geometric viral titres per ml lung homogenate revealed 
relatively low SCV titers in. the lungs of SCV-inoculated cats (1 x 103 ±0.51 TCID50) 
compared to ferrets (1 x 106 ±0.70 TCID50). Histologically, SCV infection was 
associated with pulmonary lesions similar to those in SCV-infected macaques, except 
that they were milder, especially in SCV infected cats and syncytia were not found. In 

2 0 the gastro-intestinal and urinary tracts SCV was detected by RT-PCR (Fig 8). Follow 

up of the remaining SCV-inoculated animals (n = 2 per group) revealed that they all 
had seroconverted by 28 d.p.i. (neutralising antibody titers 40-320). Two attempts to 
infect suckling mice through intracerebral inoculation failed. 

Non-inoculated cats (Fig 7c, n = 2) and ferrets (Fig 7d, n = 2) housed together 
25 with the inoculated cats and ferrets, respectively, became infected with SCV; viral 
titers gradually increased from day 2 onwards and peaked at 6 to 8 d.p.i. Neither of 
the cats showed chnical signs but had seroconverted by day 28 (virus neutralising 
antibody titres of 40 and 160). Both ferrets showed lethargy and conjunctivitis and 
died on 16 and 21 d.p.i. Based on pathologic examination, the main lesions in these 

3 0 two animals were marked hepatic lipidosis and emaciation. There was no evidence 

that either of these animals died of SCV-associated pneumonia, although SCV was 
isolated from postmortem lung specimens of one animal. 

In conclusion, domestic cats and ferrets are susceptible to experimental SCV 
infection and transmission of SCV to non-inoculated animals occurs efficiently. Both 
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species potentially could be used as animal models to test antiviral drugs or vaccine 
candidates against SARS. 



Example 4: S&RS- interferon experiments 

In a first experiment four groups of two monkeys were injected. 

1. PEG- INTERFERON treatment 

Dose: 3 ng/kg or PBS injected intramuscularly according the following 
scheme: 

Monkey : 

M001 PBS. at days -3, -1, +1 and +3 

M002 PBS at days -3, -1, +1 and +3 

M003 I FN at days -3, -1, +1 and +3 

M004 I FN at days -3, -1,. +1 and +3 • 

M005 PBS at d.-3 and -1 and I FN at d.+l and +3 

M006 PBS at d. -3 and -1 and I FN at d. +1 and +3 

M007 I FN at days -3, -1, +1 and +3 

MOOS I FN at days -3, -1, +1 and +3 

2. Infection 

SARS coronavirus infection of all monkeys on day 0 / 

Dose: 10 6 TCID 50 in 5 ml PBS 

o 4 ml intra-tracheal 

• 1 ml intranasal 

a 0.5 ml on each of the eyes 
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3. Sampling 

a. Nose throat and rectum swabs taken on days 0, 2 and 4 and were put 
in 1 ml transport medium. 

b . Monkeys were euthanised on day 4 and samples of lung, tracheal 
bronchial lymph node and trachea were harvested 

Virus was cultured and titrated on Vero-118 cells, and these were scored for 
cytopathic affects 

Virus titration using the three different swabs taken on days 0 , 2 and 4 after 
infection (nose, throat and rectum) and isolation of virus from the lungs, tracheal 
bronchial lymph node and trachea at day 4 after infection demonstrated that the two 
control monkeys (M001 and M002) were successfully infected (table 1). 

Table 1 SARS -associated coronavirus excretion by cynomolgus 
macaques treated with pegylated interferon. 



Pharyngeal swab Tr. Br lymph node 



M001 
M002 
M003 
MO 04 
M0Q5 
M006 
M007 
M0 08 



n.a. 
n.a. 



30 



day post infection 



1. Control animals (M001, 002) 

o Pharyngeal swabs on days 2 and 4 were all positive 
• Animal M001 also was found positive with respect to isolation of SARS 
coronavirus from the nasal swab (day 2 and 4). 
35 • No rectal swabs were positive 
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• Tissue specimen from the lungs, trachea and trachea bronchial lymph node 
from both control animals (M001 and M002) were positive at day 4 when the 
animals were sacrificed. The lung tissue homogenate contained virus at a high 
titer because the Vero cultures were found positive rapidly after inoculation. 

2. Prophylactically treated animals (M003, 004, 007 & 008) 

• negative with respect to the virus isolation test on pharyngeal swabs taken 
' at day 0, 2 and 4 after infection (table 1). 

• No nasal swab was found positive in these animals. 

• Only one rectal swab of animal M004 at day 4 was scored positive (which has 
to be confirmed in the PCR assay because these cultures showed much 
bacterial contamination (cultures of rectal swabs) 

• No virus isolated from trachea of M003 and 004. 

• Virus isolated from tracheal bronchial lymph node of M004 but not M003 

• Virus isolated from lungs of M003 and 004, but are at lower titre than controls 
as it took, longer for CPE to be observed in Vero-118 cells inoculated with 
samples from the lungs (confirmed by PCR - figure A below) 

3. Therapeutically treated animals (M005 and 006) 
SARS coronavirus 

• isolated from pharyngeal swabs taken at day 2 after infection 

• not isolated from the pharyngeal swabs taken at day 4 after infection. 

• isolated from more tissue samples and at higher titers from animal M005 and 
M006, than from animal M003 and M004 (quantitation confirmed by PCR) 

Pathological examination of lung section . stained by HE confirmed the low level 
infection of the lungs of animal M003. 

In a second experiment treatment of Cynomolgus macaques was preceded by an in 
vitro dose-finding experiemnt on Vero cells. 
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In vitro study 

Wells containing Vero cells were treated in triplicate with pegylated 
recombinant IFN-a (PEG-Intron, Shering Corp) for 16 h and infected. with 100 TCIDso 
per well of SCV, obtained from patient 5688, who died of SARS. After 16 h the 
5 supernatant was removed and cells were fixed by 10% neutral-buffered formalin and 
70% ethanol (10 min RT). SCV antigen positive cells were visualised by 
immunohistochemistry, as described under histology. The number of SCV-infected 
cells per well was summarized as mean ± s.d.. 

10 Macaque studies. 

Three groups of cynomolgus macaques were infected intratracheally with 1 x 

10 s TCIDso SCV suspended in 5 ml of phosphate buffered saline (PBS). One (the 

control group, ?i = 4) was injected intramuscularly with PBS and two (the 

prophylactic group, n = 6 ; the post-exposure group, n — 4) with pegylated IFN-a at a 

15 dose of 3 |J.g/kg. The prophylactic group was injected with pegylated IFN-a at days -3, 
-1, 1 and 3 after SCV infection and the post-exposure group at days 1 and 3 after 
SCV infection. Four macaques from each group were euthanised at day 4 after 
infection. Approval for the animal experiments had been obtained from the 
Institutional Animal Welfare Committee. At days -3, —2, —1, 0, +2 and +4, we 

2 0 anaesthetised the macaques with ketamine and collected 10 ml blood from inguinal 
veins and took pharyngeal swabs, which were placed in 1 ml transport medium 
(Fouchier, R.A. et al, J. Clin. Microbiol. 3JL 4096-5001). Pharyngeal swabs were 
frozen at -70 °C until RT-PCR analysis. Pegylated IFN-a levels were determined 
using an ELISA (Bender MedSystems Diagnostics) using PEG-Intron as a standard 

2 5 and neopterin levels were determined as described by van Gool et al. (Psychiatry Res. 

119. 125-132, 2003). Necropsies were done according to a standard protocol; one lung 
of each monkey was inflated with 10% neutral-buffered formalin by intrabronchial 
intubation and suspended in 10% neutral-buffered formalin overnight. Samples were 
collected in a standard manner (one from the cranial part of the lung, one from the 

3 0 medial and two from the caudal part), embedded in paraffin, cut at 5 urn and used for 

immunohistochemistry (see below) or stained with haematoxylin and eosin (HE). For 
semiquantitative assessment of SCV-infection-associated inflammation in the lung, 
each HE-stained section was examined for inflammatory foci by light microscopy 
using a 10x objective. Each focus was scored for size (1: smaller than or equal to area 
35 of 10x objective, 2: larger than area of 10x objective and smaller than or equal to area 
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of 2.5* objective, 3: larger than area of 2.5* objective) and severity of inflammation 
(1: mild, 2: moderate, 3: marked). The cumulative scores for the inflammatory foci 
provided the total score per animal. Sections were examined without knowledge of 
the identity of the macaques. The lung sections of one monkey in the post exposure 
5 group were not assessed because of the presence of inflammation from pre-mortem 
aspiration of food remains. Lung samples from a control group macaque were Used for 
transmission electron microscopy as described by Kuiken, T. et al. (Lancet 262, 263- 
270, 2003). 

Three lung tissue samples taken from the other lung (one from the cranial 
10 part of the lung, one from the medial part, and one from caudal part) were 

homogenised in 2 ml transport medium using Polytron PT2100 tissue grinders 
(Kinematica). After low speed centrifugation, the homogenates were frozen at -70° C 
until inoculation on Vero 118 cell cultures in 10-fold serial dilutions. The identity of 
the isolated virus was confirmed as SCV by RT-PCE of supernatant. 

15 

Immunohistochemistry 

The same formalin-fixed paraffin-embedded lung samples as used for histology 
- one from the cranial part of the lung, one from the medial part, and two from 
caudal part — were cut at 5 um, and stained for SCV antigen using a biotinylated 

2 0 purified human IgG from a convalescent SARS patient, negative control biotinylated 
purified human IgG, or the dilution buffer, as described by Kuiken et al. (supra). 
Twenty-five arbitrarily chosen 20x objective fields of lung parenchyma in each lung 
section were examined by light microscopy for the presence of SCV antigen 
expression, without knowledge of the identity of the macaques. The cumulative scores 

2 5 for each animal were expressed as number of positive fields per 100 fields (%). 

Selected lung sections from macaques in the control group were stained with anti- 
cytokeratin monoclonal antibody AE1/AE3 (Neomarkers) for identification of 
epithelial cells, according to standard immunohistochemical procedures. 



3 0 SCVRT-PCR 
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An KT-PCR with primers and probe specific for the nucleoprotein (NP) gene of 
SCV was used to quantificate SCV in swabs as described by Kuiken et al. (supra). 
Serial dilutions of the SCV stock were used as a standard and the results were 
expressed as SCV eq/ml swab medium. 



Results 

The dose finding study (3 separate experiments) on Vero cells showed a dose- 
dependent effect on the numbers of SCV infected cells per well. A significant effect 
was already observed at a dose of 1 ng/ml drug, while a dramatic reduction in the, 
10 number of infected cells was observed at doses higher than 1 ng/ml (fig. 11a). 

The control macaques showed multifocal acute DAD (diffuse alveolar damage), 
characterized by flooding of alveoli with protein-rich oedema fluid mixed with 
neutrophils and rare syncytia, extensive loss of alveolar and bronchiolar epithelium 
15 and occasional type 2 pneumocyte' hyperplasia. As indicated by 

immunohistochemistry, there was extensiveSCV antigen expression of squamous 
cells lining the alveolar walls. They were indicated as type 1 pneumocytes by their 
location, morphology, and expression of keratin in serial sections. By transmission 
electron microscopy, coronavirus-like particles measuring about 70 nm in diameter 

2 0 with typical internal nucleocapsid-like structure were found in alveolar cells. These 

cells were identified as type 1 pneumocytes because they lined the alveolar lumen, 
were closely apposed to the basement membrane, were squamous, contained 
abundant pinocytotic vesicles, and - in contrast to type 2 pneumocytes - had neither 
lamellar bodies nor microvilli. As found previously in experimentally infected 
25 macaques at 6 d.p.i., less extensive SCV antigen expression also was detected in 

hyperplastic type 2 pneumocytes within inflammatory foci. The combination of these 
histopathologic and immunohistochemical findings show that type 1 pneumocytes are 
the main target of SCV in early infection, and are associated with DAD. 

3 o High plasma levels of pegylated IFN-a after intramuscular injection into a 

group of six macaques (prophylactic group) were attained 1 day after injection (Fig. 
lib), similar to peak levels found in patients after subcutaneous injection with 3 
ug/kg pegylated IFN-a (Bukowski, R.M. et al, Cancer 9§, 389-396, 2002). Because 
IFN-a is known to activate macrophages (van Gool et al, supra), plasma levels of 
3 5 neopterin following pegylated IFN-a treatment were measured as a measure of 



ISDOCID: <WO 2004089983A2_L> 



BNS page 49 



WO 2004/089983 



49 



PCT/NL2004/000229 



macrophage activation. Neopterin levels were increased in all animals (Fig. 11c), 
confirming the biological availability of pegylated IFN-a in the treated macaques. 

To evaluate the prophylactic use of pegylated IFN-cc, we experimentally 
infected the macaques in the prophylactic group with SCV at 3 days after the start of 
5 pegylated IFN-a treatment, and compared virological and pathological parameters 
with a control group of four macaques treated with PBS instead. We limited our 
investigation to the pharyngeal swabs and the lung because an earlier study did not 
provide evidence of extensive viral replication in other organs (Kuiper et at, supra) 
.We found that all parameters were significantly reduced in the prophylactic group 

1 0 compared to the control group. By virology, virus excretion from the pharynx was 
abrogated (Fig. 12), and the virus titre in the lungs at 4 d.p.i. was significantly 
reduced (Fig- 13a). By immunohistochemistry, the expression of SCV in type 1 
pneumocytes was 90%. reduced (Fig. 13b). By pathology, the extent and severity of 
DAD was 80% reduced (Fig. 13c). These data demonstrate that prophylactic use of 

15 pegylated IFN-cc substantially, although not completely, protects type 1 pneumocytes 
of experimentally infected macaques from SCV infection, with abrogation of virus 
excretion and reduced severity of pulmonary lesions. 

To test the efficacy of pegylated IFN-a as an antiviral agent post-exposure, we 
injected pegylated IFN-a intramuscularly into a post-exposure group of four 

2 0 macaques 1 and 3 days after experimental SCV infection, and evaluated them in the 
same way as the prophylactic group. Excretion of SCV from the pharynx was found 
only on 2 d.p.i. at a significantly reduced level compared to the control group (Fig. 
. 12). Moreover, the virus titre in the lungs at 4 d.p.i. was significantly decreased, 
whereas the remaining parameters were less reduced (Fig. 13a-c). These results 

2 5 show that use of pegylated IFN-a one day post-exposure protects type 1 pneumocytes 

of experimentally infected macaques from SCV infection but is less effective than 
prophylactic use. 

In this study, We have shown that type 1 pneumocytes are the main target cell 
for SCV infection of cynomolgus macaques early in the disease, and that pegylated 

3 0 IFN-a protects type 1 pneumocytes from SCV infection. The first point— type 1 

pneumocytes as the primary target cell— is evident from the extensive presence of 
SCV in type 1 pneumocytes at 4 d.p.i). The temporal sequence of lung lesions that 
emerges when the pathological studies in humans and macaques are viewed together 
is: viral infection and subsequent loss of type 1 pneumocytes; acute DAD, 
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characterized by flooding of alveolar lumina with highly proteinaceous oedema fluid; 
chronic DAD, characterized by type 2 pneumocyte hyperplasia; and, in severe cases, 
extensive pulmonary fibrosis. This sequence of events corresponds to the stereotypic 
alveolar reaction to acute lung injury from a variety of causes (Ware, L.B. and 
5 Matthay, M.A., N. Eng. J. Med. 342, 1334-1349, 2000). 

The second point — that pegylated IFN-a protects type 1 pneumocytes from 
SCV infection — is based on the beneficial effect of pegylated IFN-a therapy initiated 3 
days before SCV inoculation of macaques. In these macaques, SCV infection of type 1 
pneumocytes and severity of lung lesions were significantly reduced (Fig. 13), and 

10 viral excretion was abrogated (Fig. 12). Pegylated IFN-a treatment thus has an 

important effect on the outcome of SARS. Therefore, reduction of the viral load by 
pegylated IFN-a therapy at an early stage of SCV infection helps to prevent serious 
or fatal outcome of SARS associated with pulmonary fibrosis. In addition to potential 
disease mitigation, reduced viral excretion through pegylated IFN-a therapy also has 

15 an epidemiological effect by reducing the spread of SCV in the human population. 
Whether the mechanism of pegylated IFN-a protection is by direct antiviral activity 
or immunostimulatory effects remains to be determined. 

The time interval during which effective post-exposure treatment with 
pegylated IFN-a can be initiated may be longer in humans than in the experimentally 

2 0 infected macaques. This is because the peak of SCV infection in the lungs is at about 

16 d.p.i. in humans — based on an average incubation period of 6 days (Booth, CM. et 
al, JAMA 289, 2801-2809, 2003) and a peak in viral excretion at 10 days after onset of 
symptoms (peiris, J.S.M. et al, Lancet 361, 1767-1772, 2003)— compared to 2 d.p.i. in 
these macaques (Fig. 12). 
25 In conclusion, these studies show that type 1 pneumocytes are the main target 

cell for SCV infection of macaques early in the disease, and that pegylated EFN-a, a 
commercially available antiviral drug, protects these cells from SCV infection. 
Prophylactic or early post-exposure treatment with pegylated IFN-a will help to 
reduce the impact of SCV infection on healthcare workers and others possibly 

3 0 exposed to SCV and to limit the spread of the virus in the human population. 
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1. An isolated essentially mammalian positive-sense single stranded UNA virus 
(SAKS) comprising one or more of the sequences of figure 2. 

2. An isolated positive-sense single stranded RNA virus (SARS) belonging to the 
Coronaviruses and identifiable as phylogenetically corresponding thereto by 
determining a nucleic acid sequence of said virus : and testing it in phylogenetic tree 
analyses wherein maximum likelihood trees are generated using 100 bootstraps and 
3 jumbles and finding it to be more closely phylogenetically corresponding to a virus 
isolate having the sequences as depicted in figure 2 than it is corresponding to a virus 
isolate of BoCo (bovine coronavirus), MHV (murine hepatitis virus), AIBV (avian 
infectious bronchitis virus), PEDV (porcine epidemic diarrhea virus), TGEV 
(transmissible gastroenteritis virus) or 229E (human coronavirus 229E).. 

3. A virus according to claim 1 or 2 wherein said nucleic acid sequence comprises 
an open reading frame (ORE) encoding a viral protein of said virus. 

4. A virus according to claim 3 wherein said open reading frame is selected from 
the group of ORFs encoding the viral replicase, nuclear capsid protein and the spike 
protein. 

5. A virus according to claim 1-4 isolatable from a human with atypical 
pneumonia. 

6. An isolated or recombinant nucleic acid or SARS virus-specific functional 
fragment thereof obtainable from a virus according to anyone of claims 1 to 5. 

7. A vector comprising a nucleic acid according to claim 6. 

8. A host cell comprising a nucleic acid according to claim 6 or a vector according 
to claim 7. • 
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9. An isolated or recombinant proteinaceous molecule or SARS virus-specific 
functional fragment thereof encoded by a nucleic acid according to claim 6. 

10. An antigen comprising a proteinaceous molecule or SARS virus-specific 
5 functional fragment thereof according to claim 9. 

11. An antibody specifically directed against an antigen according to claim 10. 

12. A method for identifying a viral isolate as a SARS virus comprising reacting 
10 said viral isolate or a component thereof with an antibody according to claim 11. 

13. A method for identifying a viral isolate as a SARS virus comprising reacting 
said viral isolate or a component thereof with a nucleic acid according to claim 6. 

15 14. A method for virologically diagnosing a SARS infection of a mammal 

comprising determining in a sample of said mammal the presence of a viral isolate or 
component thereof by reacting said sample with a nucleic acid according to claim 6 or 
an antibody according to claim 11. 

20 15. A method for serologically diagnosing a SARS infection of a mammal 

comprising determining in a sample of said mammal the presence of an antibody 
specifically directed against a SARS virus or component thereof by reacting said 
sample with a proteinaceous molecule or fragment thereof according to claim 9 or an 
antigen according to claim 10. 

25 

16. A diagnostic kit for diagnosing a SARS infection comprising a virus according 
to anyone of claims 1 to 5, a nucleic acid according to claim 6, a proteinaceous 
molecule or fragment thereof according to claim 9, an antigen according to claim 10 
and/or an antibody according to claim 11. 

30 

17. Use of a virus according to any one claims 1 to 5, a nucleic acid according to 
claim 6, a vector according to claim 7, a host cell according to claim 8, a proteinaceous 
molecule or fragment thereof according to claim 9, an antigen according to claim 10, 
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or an antibody according to claim 11 for the production of a pharmaceutical 
composition. 

18. Use according to claim 17 for the production of a pharmaceutical composition 
for the treatment or prevention of a SAES virus infection. 

19. Use according to claim 17 or 18 for the production of a pharmaceutical 
composition for the treatment or prevention of atypical pneumonia. 

20. A pharmaceutical composition comprising a virus according to any one of 
claims 1 to 5, a nucleic acid according to claim 6, a vector according to claim 7, a host 
cell according to claim 8, a proteinaceous molecule or fragment thereof according to 
claim 9, an antigen according to claim 10, or an antibody according to claim 11. 

21. A method for the treatment or prevention of a SAES virus infection comprising 
providing an individual with a pharmaceutical composition according to claim 20. 

22. A method for the treatment or prevention of atypical pneumonia comprising 
providing an individual with a pharmaceutical composition according to claim 20. 

23. A viral rephcase encoded by an RNA sequence comprising the sequences EMC- 
1, EMC-2, EMC-3, EMC-4, EMC-5, EMC-6, EMC-7, EMC- 13 and/or EMC-14, or 
homologues thereof as depicted in figure 2. 

24. A viral spike protein comprising the amino acid depicted as translation 2 with- 
sequence EMC-7 and translation 1 of RDG 1 as depicted in figure 2, or a homologue 
thereof. 

25 ■ A viral nuclear capsid protein encoded by an RNA sequence comprising the 
sequence EMC-8 as depicted in figure 2 or a homologue thereof. 

26. A viral protein encoded by an RNA sequence comprising the sequence EMC-9, 
EMC- 11 and/or EMC- 12 as depicted in figure 2. 
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27. A nucleic acid sequence which comprises one or more of the sequences EMC- 1 
to EMC- 13 as depicted in figure 13 or a nucleic acid sequence which can hybridise 
with any of these sequences under stringent conditions. 

28. Use of interferon for the preparation of a medicament for the 
treatment or prevention of a coronavirus associated disease. 

29. Use according to claim 28 wherein said interferon is interferon 
alpha. 



30. Use according to claim 29, wherein said interferon is interferon- 
alpha 2a. 

31. Use according to claim 29, wherein said interferon is interferqn- 
alpha 2b. ( 

32. Use according to any of claims 28-31, wherein said interferon is pegylated. 

33. Use according to any of claims 28-32, wherein said coronavirus associated 
disease is a disease of animals, preferably vertebrates, more preferably birds or 
mammals, especially humans, ape or rodent. 

34. Use according to claim 33, wherein said disease is a respiratory disease and/or 
gastroenteritis. 

35. Use according to claim 33 or claim 34, wherein said animal is human. 

36. Use according to any of claims 28-35 wherein said coronavirus associated 
disease is a disease caused by HcoV-NL, the feline infectious peritonitis virus (PIPV) 
or hemagglutinating encephalomyelitis virus (HEV) of swine or avian infectious 
bronchitis virus (IBV) or mouse hepatitis virus (MHV). 

37. Use according to any of claims 28-35 wherein said coronavirus associated 
disease is a disease caused by a SARS coronavirus (SARS-CoV). 
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38. Use according to claim 37, wherein said SARS virus is a positive-sense single 
stranded RNA virus (SAES coronavirus) comprising one or more of the sequences of 
figure 2. 

5 

39. Use according to claim 28, wherein said SARS virus is a positive-sense single 
stranded RNA virus (SARS coronavirus) corresponding to GenBank accession no. 
AY274119 or AY278741 or AY338175 or AY338174 or AY322199 or AY 322198 or 
AY322197 or AH013000 or AY322208 or AY322207 or AY 322206 or AY322205 or 

10 AH012999. 

40. A method for the treatment or prevention of a coronavirus associated disease 
in an animal, preferably a vertebrate, more preferably a bird or mammal, especially 
human, ape or rodent, infected with a coronavirus, said method comprising 

15 administrating interferon, to said animal, preferably a vertebrate, more preferably a 
bird or mammal, especially human, ape or rodent, along with a pharmaceutically 
acceptable carrier. 

'41. A method according to any of claims 40 wherein said interferon is 
2 0 administered together with a vaccine, antibody and/or antiviral agent. 

42.' A method according to claim 41, wherein said vaccine, antibody and/or anti- 
viral agent is selected from the group consisting of whole inactivated virus vaccines, 
attenuated vaccines, sub-unit vaccines, recombinant vaccines, antibody for passive 
2 5 immunization, nucleoside analogs such as ribavirin, RNA-dependent RNA 
polymerase inhibitors, protease inhibitors. 
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Figure 2 RNA sequences, implied polypeptides and alignment with one close 
relative 



EMC-1 

5 UUGUAACUGGUGGUCUUGDACAACAGACUUCUCAGUGGUUGDCUAAUCUUUUGGGCACDACUGGUUGARAAAC 
UCAGGCCUAUCUUUGAAUGGAUUGAGGCGAAACUUAGUGCAGGAGUUGAAUDUCUCAAGGAUGCUUGGGAGAU 
OCUCAAADUUCUCADOACAGGUGUUDUDGACAUCGOCAAGGGUCAAAOACAGGUUGCUUCAGAOAACAUCAAG 
GAUUGUGDAAAAUGCUUCAUDGADGUUGUUAACAAGGCACUCGAAAUGUGCAUOGAUCAAGUCACUAUCGCUG 
GCGCAAAGUUGCGAUCACUCAACUUAGGUGAAGUCUUCAUCGCUCAAAGCAAGGGACUUUACCGUCAGUGOAU 

1 0 ACGUGGCAAGGAGCAGCUGCAACUACUCAUGCCUCTOAAGGCACOyVAAGAAGUAACCUUUCU 
UGAAGGUGAUUCACAUGACACAGUACUUACCUCUGAGGAGGUUGUUCUCAAGAACGGUGAA 
CUCGAAGCACUCGAGACGCCCGUUGAUAGCUUCACAAAUGGAGCUAUCGUUGGCACACCAG 
UCUGUGUAAAUGGCGUCAUGCUCUUAGAGAUUAAGGACAAAGAACAAUACUGCGCAUUGUC 
UCCUGGUUUACUGGCUACaAACAAUGUCUUUCGCUUAAAAGGGGGUGCACCAAUUAAAGGU 

1 5 GUAACCUUUGGAGAAGAUACUGUUUGGGAAGUUCAGGGUUACAAGAAUGUGAGAAUCACAU 
UUGAGCUUGAUGAACGUGUUGACAAAGUGCUUAAUGAAAAGUGCUCUGUCUAGACUGUUGA 
AUCCGGUACCGAAGUUACUGAGUUUGCAUGUGUUGUAGCAGAGGCUGUUGUGAAGACUUUA 
CAACCAGUUUCUGAUC 



2 0 Trans /flft'O flNucleotideS 7 to 87 0: Frame 1; 288 a a 

LWLYNRLLSGCLIFWAI.LVEKLRPIFEWIEAKLSAGVEFLKDAWEILKFLITGVFDIVKGQIQVASDNIKDCVKCFIDVV 
NKA.LEMCIDQVTIAGAKLRSLNLGEVFIAQSKGLYRQCIRGKEQLQLLMPLKAPKEVTFLEGDSHDTVLTSEEWLKNGEL 
EALETPVDSFTNGAIVGTPVCVNGLMLLEIKDKEQYCALSPGLIATNNVFRLKGGAPIKGVTFGEDTVWEVQGYKNVRITF 
ELDERVDKVLNEKCSVYTVESGTEVTEFACWAEAVVKTLQPVSD 

25 

Alignment 

RNA-directed RNA polymerase (orfla) murine hepatitis virus 
Identities ■ 72/285 (25%), Positives = 118/285 (41%) 

FWALLVEKLRPIFEWIEAKLSAGVEFLKDAWEILKFLITGVFDIVKGQIQVASDNIKDCV 228 
F AL V ,+R I EW + L+ + W + L+ G+F + G I + + + V 

FKALGVAWRKITEWFD— LAVDIAASAAGWLCYQ-LVNGLFAVANGVITFVQE-VPELV 693 

KCFIDVVNKALEMCIDQVTIA GAKLRSLNLGEVFIAQSKGLYRQCTRGKEQLQLLMP 399 

K F+D ++ ID ++++ G + V +A SK +Y + K +MP 



LEIKDKEQYCAL SPGLLATNNVFRLKGGAPIKGVTFGEDT-VWEVQGYKNVRITF 735 

K +Q+ + + G+L F G KVF+ V++ + ++ITF 
YMAKCGDQFYPVVVDNDTVGVLDQCWRFPCAG ; KKVEFNDKPKVRKIPSTRKIKITF 857 

ELDERVDKVLNEKCSVYTVESGTEVTEFACWAEAVVKTLQPVSD 870 
LD D VL++ CS + V+ + E VV +AV TL P + 

ALDATFDSVLSKACSEFEVDKDVTLDELLDVVLDAVESTLSPCKE 902 



30 


Query: 


; 49 




Sbjct: 


: 638 




Query: 


: 229 


35 








Sb j ct : 


; 69'4 






: 400. 


40 


Sbjct; 


: 751 




Query: 


: 574 




Sbjct: 


802 


45 








736 




Sbj ct: 


858- 


50 


EMC- 


14 



cauccagcuucuuaaggcagcauaugaaaauudcaauucacaggacaucuuacuugcaccaduguugucagca 
ggcadaduuggugcuaaaccacuucagucuuuacaagogugcgugcagacgguucguacacagguuuauauug 
cagucaaugacaaagcucuuuaugagcagguugucauggauuaucuogadaagcugaagccuagaguggaagc 
55 accuaAacaagaggagccaccaaacacagaagaudccaaaacugaggagaaadcugucguacagaagccuguc 
gaugogaagccaaaaauuaaggccugcauugaogagguuaccacaacacuggaagaaacoaagouucuuacca 
auaagudacucougudugcugauaucaadgguaagcuuuaccaugaoucucagaacaugcuuagaggugaaga 
uaugucduuccuugagaaggaugcaccdoacaugguaggugauguuaucacuaguggugauaucacuuguguu 
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GOA7AUACCCUCCAAAAAGGCUGGUGGCACUACOGAGAUGCUCUCAAGAGCUUUGAAGAAAGOGCCAGUUGAUG 
AGUAUADAACCACGUACCCUGGACAAGGAUGUGCUGGUOAUACACDUGAGGAAGCUAAGACUGCUCUDAAGAA 
AUGCAAAUCUGCAUUUUAUGUACUACCDDCAGAAGCACCUAAUGCUAAGGAAGAGADUCUAGGAACUGUAUCC 
UGGAAUUGAG 

Translation 

Nucleotides 5 to 739: Frame 2; 245 aa 

IQLLKAAyENFNSQDILLAPLLSAGIFGAKPLQSLQVCVQTVRTQVYIAVNDECALYEQWMDYLDNLKPRVEAPKQEEPPN 
10 TEDSKTEEKSVVQKPVDVKPKIKACIDEVTTTLEETKFLTNKLLLFADINGKLYHDSQNMLRGEDMSFLEKDAPYMVGDVI 
TSGDITCWIPSKKAGGTTEMLSRALKKVPVDEYITTYPGQGCAGYTLEEAKTALKKCKSAFYVLPSEAPNAKEEILGTVS 
WN 

Alignment 

15 replicase polyprotein lab Human coronavirus 229E 

Identities = 48/202 (23%), Positives = 83/202 (41%), Gaps = 13/202 (6%)' 
Frame = +2 

20 Query: 8 LLKAAYENFNSQDILLAPLLSAGIFGAKPLQSLQVCVQTVRT QVYIAVNDKALYEQV 178 

L+KA N Q L P+LS GIFG K SL+V + T '+V++ + + + 

Sbjct: 1371 LIKAYNTINNEQGTPLTPILSCGIFGIKLETSLEVLLDVCNTKEVKVFVYTDTEVCKVKD 1430 

Query: 179 VMDYLDNLKPRVEAPKQEEPPNTEDSKTEEKSVVQKPVDVKPKIKACIDEVTTTLEETKF 358 
25 + L N++ +VE PK E P V KP V K +++ ++ 

Sbjct: 1431 FVSGLVNVQ-KVEQPKIEPKP- VSVIKVAPKPYRVDGKFSYFTEDLLCVADDKPI 1483 

Query: 359 L — TNKLLLFADINGKLYHDSQNMLRG — EDMSFLEKDAP YMVGDVITSGDITC 508 

+ T+ +h D L + +L +D + K P + +G V+ + . 
30 Sbjct: 1484 VLFTDSMLTLDDRGLALDNALSGVLSAAIKDCVDINKAIPSGNLIKFDIGSW VYM" 153 9 

Query: 509 WIPSKKAGGTTEMLSRALKKV 574 " 

V+PS+K + R +K+ 

Sbjct: 1540 CWPSEKDKHLDHNVQRCTRKL 1561 

35 

EMC -2 

UCGAGAUOUcAUcUUGACGGDGCAGGDUC.UDUCACUDGACAAACUAAAGAGUCUCUUAUCCCUGCGGGAGGUU 
AAGACUAUAAAAGUGUUCACAACUGUGGACAACACUAAUCUCCACACACAGCUUGUGGAUAUGUCUAUGACAD 
. AUGGACAGCAGUUUGGDCCAACAUACUOGGADGGUGCUGAUGUDACAAAAAUUAAACCDCAUGUAAAUCAUGA. 
40 GGGUAAGACUUOCUUUGUACUACCUAGUGAUGACACACUACGUAGDGAAGCUUUCGAGUACUACCAOACUCUO 
GAOGAGAGDUUUCUUGGUAGGUACAUGUCUGCUUDAAACCACACAAAGAAADGGAAA 

Translation 

Nucleotide 2 to 349: Frame 2; 116 aa 

45 

RDFILTVQVLSLDKLKSLLSLREVKTIKVFTTVDNTNLHTQLVDMSMTYGQQFGPTYLDGADVTKIKPHVNHEGKTFFVLP 
S D DTLRSEAFEY YHTLDES FLGRYMSALNHTKKWK 



5 0 Alignment 

> Bovine Coronavirus RNA-Dependent RNA polymerase 



Identities = 25/90 (27%), Positives = 44/90 (481) 
55 Frame = +2 

Query: 80 IKVFTTVDNTNLHTQLVDMSMTYGQQFGPTYLDGADVTKIKPHVNHEGKTFFVLPSDDTL 259 

+ + ' TVD N + V + ++G+ G + DG +VTK K . +N++GK FF + + 
.Sbjct: 1565 VDILLTVDGVNFTNRFVPVGESFGKSLGNVFCDGVNVTKHKCDINYKGKVFFQFDNLSSE 1624 

Query: 2 60 RSEAFEYYHTLDESFLGRYMSALNHTKKWK 349 

+A D+ L Y + L + KW+ 

Sbjct: 1625 DLKAVRSSFNFDQKELLAYYNMLVMCSKWQ 1654 
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Fig. 2 Cont. 
EMC13: 

CUGAAGAAGUAGUGGaAAAUCCUACCADACAGAfi.GGAAGUCAUAGAGUGUGACGDGAAAACDACCGAAGUUGU 
AGGCAAUGUCAUACUUAAACCAUCAGAUGAAGGUGUUAAAGUAACACAAGAGUtJAGGUCAUGAGGADCUUAUG 
GCUGCUUAUGUGGAAAACACAAGCAUUACCAUUAAGAAACCUAAUGAGCUUOCACUAGCCUaAGGUUUAAAAA 
CAAUUGCCACUCAUGGUAUUGCUGCAAUUAAUAGUGUDCCUaGGAGOAAAAOUUUGGCUUAUGUCAAACCAUD 
CUUAGGACAAGCAGCAAUUACAACAOCAAAUUGCGCUAAGAGAUOAGCACAAGGUGDGUUUAACAAUUAUAUG 
CCUUAUGUGUUUACAUDAOUGUUCCAAUUGUGUACUUDOACUAAAAGUACCAAUDCDAGAAUUAGAGCUUCAC 
UACCUACAACUAUUGCDAAAAAOAGUGUUAAGAGUGUOGCUAAADOADGUUUGGAUGCCGGCAOUAAUUAUGU 
GAAGUCACCCAAAUDUUCaAAft.UUGUDCACAAUCGCUAUGUGGCOAUUGUUGUUAAGUADUUGCUUAGGUUCU 
CUAAUCUGUGUAACUGCaGCDUOUGGUGOACUCDUAUCUAAUUUUGGUGCDCCUUCUUAUUGUAAUGGCGUUA 
GAGAAUUGUAUCUOflAUOCGUCUARCGUUACUACUAUGGAOUDCDGDGAAGGUUCUUUUCCUUGCAGCAUOUG 
UOUAAGUGGAUUAGACUCCCOUGAUUCUUAUCCAGCUCUUGAAACCAUUCAGGUGACGAUDUCAUCGUACAAG 
CUAGACUUGACAAUUUUAGGUCUGGCCGCDG 



Translation 

>~out: 3 .to 833: Frame 3 277 aa ■ 

EEWE^PTIQKEVIECDVKTTEWGNVILKPSDEGVKVTQELGHEDLMflAYVENTSITIKKPNELSLALGLKTIATHGIAA 
INSVPWSKILAYVKPFLGQAAITTSNCAKRLAQRVFNNYMPYVFTLLFQLCTFTKSTNSRIRASLPTTIAKNSVKSVAKLC 
LDAGINYVKSPKFSKLFTIAMWLLLLSICLGSLICVTAAFGVLLSNFGAPSYCKGVRELYLNSSNVTTMDFCEGSFPCSIC 
LSGLDSLDSYPALETIQVTISSYKLDLTILGLAA 



Alignment 

bovine coronavirus RNA-dependent RNA Polymerase 
Identities = 50/269 (18%), 

Query: 57 KTTEWGNVILKPSDEGVKVTQELGHEDLMAAYVEKTSITIKKPNELSLALGLKTIATH- 233 

K +V +VI+ +K + L D+ ++ ++ N+LS+A+ + TI 

Sbjct: 204 6 KPFKVEDSVIVNDDTSEIKYVKSLSIVDVYDMWLTGCRYVVRTANDLSMAVNVPTIRKFI 2105 

Query: 234 — GIAAINSVPWSKI-LAYVKPFLGQAAITTSNCAKRLAQRVFN--NYMPYVFTLLF 389 

G+ + S+P + L +KP N K + ++ N++ ++F LLF 

Sbjct: 2106 KFGMTLV-SIPIDLLNLREIKPVF NWKAVRNKISACFNFIKWLFVLLFGWI 2156 

Query: 390 — -QLCTFTKSTNSRIRASLPTTIAKNSVKSVAKLCLDAGINYVKSPKFSKLFTIAMW 554 

. +T S++ L . KN+ + + G + + • +W 
Sbjct: 2157 KISADNKVIYTTEVASKLTCKLVALAFKNAFLTFKWSWARGACIIAT IFLLW 220 9 

Query: 555 XXXXXXXXXXXXXVTAAFGVLLSNFGAPSYCNGVRELYLNSSNVTTM ■ 695 

G L P++ .+■+ ++++ T+ 

Sbjct: 2210 FNFIYANVIFSDFYLPKIGFL- PTFVGKIAQWIKSTFSLVTICDLYSIQDVGFKN 2263 

Query: 696 DFCEGSFPCSICLSGLDSLDSYPALETIQ 782 

+C GS C CL+G D LD+Y A++ +Q 
Sbjct: 2264 QYCWGSIACQFCLAGFDMLDNYKAIDWQ 2292 



EMC -3 

GUGGUAAGAUUGUUAGUACUUGUUUUAAACUUAUGCUUAAGGCCACAUUAUUGUGCGUUCU 
UGCUGCAUUAGUUUGUUAUAUCGUUAUGCCAGUACAUACAUUGUCAAUCCAUGAUGGUUAC 
ACAAAUGAAAUCAUUGGUUACAAAGCCAUUCAGGAUGGUGUCACUCGUGACAUCAUUUCUA 
CUGAUGAUUGUUUUGCAAAUAAACAUGCUGGUUUUGACGCAUGGUUUAGCCAGCGUGGUGG 
UUCAUACAAAAAUGACAAAAGCUGCCCUGUAGUAGCUGCUAUCAUUACAAGAGAGAUUGGU 
UUCAUAGUGCGUGGCUUACCGGGUACUGUGCUGAGAGCAAUCAAUGGUGACUUCUUGCAUU 
UCCUACCUCGUGUUUUUAGUGCUGUUGGCAACAUUUGCUACACACCUUCCAAACUCAUUGA 
GUAUAGUGAUUUUGCUACCUCU 



Translation 

Nucleotide 3-449; 149 aa 

GKIVSTCFKLMLKATLLCVLAALVCYIVMPVHTLSIHDGYTNEIIGYKAIQDGVTRDIISTDDCFANKHAGFD 
AWFSQRGGSYKNDKSCPVVAAIITREIGFIVPGLPGTVLRAINGDFLHFLPRVFSAVGNICYTPSKLIEYSDF 
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Fig. 2. Cont. 
Alignment 

> Murine Hepatitis Virus RNA-Dependent RNA polymerase 

Identities = 48/126 (38%), 

Query: 78 YIVMPVHTLSIHDGYTNEIIGYKAIQDGVTRDIISTDDCFANKHAGFDAWFSQRGG — SY 251 

+ +MP + + D +K I +GV RD+ TD CFANK FD W+ G Y 

Sbjct: 2859 WALMPTYAVHKSDMQLPLYASFKVIDNGVLRDVSVTDACFANKFNQFDQWYESTFGIiAYY 2918 

Query: 252 KNDKSCPVVAAIITREIGFIVPGLPGTVLRAINGDFLHFLPRVFSAVGNICYTPSKLIEY 431 

+H K+CPVV. A+I ++IG + +P TVLR LHF+ F+ CYTP I Y 

Sbjct: 2919 RNSKACPVWAVIDQDIGHTLFMVPTTVLR-YGFHVLHFITHAFATDSVQCYTPHMQIPY 2977. 

Query: 432 SDFATS 449 
+F S 

Sbjct: 2978 DNFYAS 2983 



ACAGACAUCAAUCACUUCUGCOGODCDGCAGAGUGGOUDU7AGGAAAAUGGCAUUCCCGUCAGGCAAAGUUGAA 
GGGDGCAUGGUACAAGUAACCUGUGGAACUACAACUCUUAAUGGADUGUGGUOGGADGACACAGUAUACOGUC 
CAAGACADGUCADDUGCACAGCAGAAGACADGCUUAAUCCUAACUAUGAAGAOCQGCUCADUCG.CAAAUCCAA 
CCAUAGCUUUCUUGUUCAGGCDGGCAADGuUCAACUUCGUGUUAUUGGCCAUUCUAOGCAAAAUUGUCDGCUU 
AGGCUUAAAGUUGAUACUUCUAACCCOAAGACACCCAAGUAUAAAUOUGUCCGUAOCCAACCDGGUCAAACAU 
UUUCAGUUCUAGCAUGCUACAAUGGUUCACCAUCOGGUGUUUAUCAGUGUGCCAUGAGACCUAADCAUACCAU 
UAAAGGUUCUUUCCUUAAUGGAUCAUGUGGUAGUGUUGGDODUAACAUUGAuUAUGAUUGCGUGUCUUUCUGC 
UAUAUGCADCAUAUGGAGCDUCCAACAGGAGDACACGCUGGUACUGACUUAGAAGGUAAAUUCUAUGGUCCAU 
UUGUUGACAGACAAACUGCACAGGCUGCAGGUACAGACACAACCAUAACAUUAAAUGOUUUGGCAUGGCUGUA 
UGCUGCUGUUAUCAAUGGUGAUA 

Translation 

Nucleotides 2 to 67 9: Frame 2; 22 6 aa 

QTSITSAVLQSGFRKMaFPSGKVEGCMVQVTCGTTTLNGLWLDDTVYCPRHVICTAEDMLNPNYEDLLIRKSNHSFLVQAG 
NVQLRVIGHSMQNCLLRLKVDTSNPKTPKYKFVRIQPGQTFSVLACYNG3PSGVYQCAMRPNHTIKGSFLNGSCGSVGFNI 
DYDCVSFCYMHHMELPTGVHAGTDLEGKFYGPFVDRQTAQAAGTDTTITLNVLAWLYAAVINGD 

Alignment 

RNA-directed RNA polymerase murine hepatitis virus 

Identities = 122/222 (54%) 

Query: 8 SITSRVLQSGFRKtffiFPSGKVEGCMVQVTCGTTTLNGLWLDDTVYCPRHVICTAEDMLNP 187 

S+T++ LQSG KM P+ KVE C+V VT G TLNGLWLDD VYCPRHVIC+4 DM 4P 
Sbjct: 3326 SVTTSFLQSGIVKMVSPTSKVEPCIVSVTYGNMTLNGLWLDDKVYCPRHVICSSADMTDP ' 3385 

Query: 188 NYEDLLIRKSNHSFLVQAGNVQLRVIGHSMQNCLLRLKVDTSNPKTPKYKFVRIQPGQTF 3S7 
+Y +LL R++ F V +G + L V+ +■ MQ C L L V NP TPKY F ++PG+TF 

Sbjct: 3386 DYPNLLCRVTSSDFCVMSGRMSLTVMSYQMQGCQLVLTVTLQNPNTPKYSFGWKPGETF 3445 

Query: 36B SVLACYNGSPSGVYQCAMRPNHTIKGSFLNGSCGSVGFNIDYDCVSFCYMHHMELPTGVH 547 

+VLA YNG P G + +R +HTIKGSFL GSCGSVG+ + D V F YMH +EIi TG H 
Sbjct: 3446 TVLARYNGRPQGAFHVTLRSSHTIKGSFLCGSCGSVGYVLTGDSVRFVYMHQBELSTGCH 3505 

Query: 548 AGTDLEGKFYGPFVDRQTAQAAGTDTTITLNVLAWLYAAVIN 673 

GTD G FYGP+ D Q Q D T T+NV+AWI1YAA+ N 
Sbjct: 3506' TGTDFSGNFYGPYRDAQWQLPVQDYTQTVNWAWIjYAAIFH 3547 



EMC -5 

Note that this sequence is not fully in frame. 

AGUUGGAAAAGAUGGCAGAUCAGGCUAUGACCCAAAUGUACAAACAGGCAAGAUCUGAGGA 
CAAGAGGGCAAAAGUAACUAGUGCUAUGCAAACAAUGCUCUUCACUAUGCUUAGGAAGCUU 
GAUAAUGAUGCACUUAACAACAUUAUCAACAAUGCGCGUGAUGGUUGUGUUCCACUCAACA 
UCAUACCAUUGACUACAGCAGCCAAACUCAUGGUUGUUGUCCCUGAUtJAUGGDACCUACAA 
GAACACUUGUGAUGGUAACACCUUUACAUAUGCAUCUGCACUCUGGGAAAUCCAGCAAGUU 
GUUGAUGCGGAUAGCAAGAUUGUUCAACUUAGUGAAAUUAACAUGGACAAUUCACCAAAUU 
UGGCUUGGCCCCUUAUUGUUACAGCUCUAAGAGCCAACUCAGCUGUUAAACUACAGAAUAA 
UGAACUGAGUCCAGUAGCACUACGACAGAUGUCCUGUGCGGCUGGUACCACACAAACAGCU 
UGUACUGAUGACAAUGCACUUGCCUACUAUAACAAUUCGAAGGGAGGUAGGUUUGUGCUGG 
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Fig. 2. Cont. 

CAUUACUAUCAGACCACCAAGAUCUCAAAUGGGCUAGAUUCCCUAAGAGUGAUGGUACAGG 
UACAAUUUACACAGAACUGGAACCACCUUGUAGGUUUGUUACAGACACAGCAAAAGGGCCU 
AAAGUGAAAUACUUGUACUUCAUCAAGGCUUAAACAACCUAAAUAGAGGUAUGGUGCUGGG 
CAGUUUAGCUGCUACAGUACGUCUUCAGGCUGGAAAUGCUACAGAAGUaCCUGCCAAUUCA 
ACUGUGCUUUCCUUCUGUGCUUUUGCAGUAGACCCUGCUAAAGCAUAUaAAGGAUUACCUA 
GCAAGUGGAGGACAACCAAUCACCAACUGUGUGAAGAUGUUGUGUACACACACUGGUACAG 
GACAGGCAAUUACUGUAACACCAGAAGCUAACAUGGACCAAGAGUCCUYUGGUGGUGCUUC 
AUGUUGUCUGUAUUGUAGAUGCCACAUUGACCAUCCAAAUCCUAAAGGAYUCUGUGACUUG 
AAAGGUAAGUACGUCCAAAUACCUACCACUUGUGCUAAUGACCCAGUGGGUUUUACACUUA 
GAAACACAGUCUGUACCGUCUGCGGAAUGUGGAAAGGUUAUGGCUGUAGUUGUGACCAACU 
CCGCGAACCCUUGAUGCAGUCUGCGGAUGCAUCAMCGUUUUUAAACGGGUUUGCGGUGUAA 
GUGCAGCCCGUCUUACACCGUGCGGCACAGGCACUAGUACUGAUGUCGUCUACAGGGCUUU 
UGAUAUUUACAACGAAAAAGUUGCUGGUUYUGCAAAGUUCCUAAAAACUAA 

Translation 1 

Nucleotide 3-701 ; 233 aa 

LEPtMADQAMTQMYKQARSEDKKAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLNIIPLTTAAKLMVV 
VEDYGTYKNTCDGNTFTYASALWEIQQWDADSKIVQLSEINMDNSPNLAWPLIVTALRANSAVKLQNNELSP 

pSpkwySfika TACTDDNA ^ YYNNSKGGRFVLALLSDHQD 
Translation 2 

PKRVCGVSA-AELTPCGTGTSTDWYRAFDIYNEKVAGXAKFLK 

Alignment 1 of translation 1 sequence 

RNA- Dependent RNA Polymerase: bovine coronavirus 

Identities = 181/413 (43%), 

Query: 3 LEKMADQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRKXXXXXXXXXXXXXRDGCyPLN 182 

LE+MAD A+T MYK4AR DK++KV SA+QTMLF+M+RK GCVPLN 
Sbjct: 3985 LERMADLALTNMYKEAR1NDKKSKWSALQTMLFSMVRKLDHQALHSILDKAVKGCVPLN 4044 

Query: 183 IIPLTTAAKLMVVVPDYGTYKNTCDGNTFTYASALWEIQQVVDADSKIVQLSEINMDNSP 362 

IP A h ++VPD Y D TYA +W+IQ + D+D QL+EI+ D + 
Sbjct: 4045 AIPSLAANTLTIIVPDKSVYDQWDNVYVTYAGNVWQIQTIQDSDGTNKQLNEISDDCN- 4103 

Query: 3 63 NLAWPLIVTALRAN SAVKLQNNEIiSPVALRQMSCAAGTTQTACTDDNALAYYNNSKGG 536 

WPL++ A R N SA LQNNEL P L+ +G QT T YYNNS G 

Sbjct: 4104 WPLVIIANRHNEVSATVLQNNELMPAKLKTQVVNSGPDQTCNTPTQ— CYYNNSNNG 4158 

Query: 537 RFVLALLSDHQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKA*TT*I 716 

+ V A+LSD LK+ + K DG + EL+PPC+F KG K+KYLYF+K T 

Sbjct: 4159 KIVYAILSDVDGLKYTKILKDDG-NFWLELDPPCKFTVQDVKGLKIKYLYFVKGCNTLA 4217 

Query: 717 EVWCWAV*LLQYVFRL EMLQKYLPIQLCFPSVLLQ*TI>LKHIKDYLASGGQPIT 878 

W V + RL E + LC . SV til D++ GG PI 
Sbjct: 4218 R GWVVGTISSTVRLQAGTATEYASNSSILSLCAFSVDPKKTYL DFIQQGGTPIA 4271 

Query: 879 NCVKMLCTHTGTGQAITVTPEANMDQESXGGASCCLYCRCHIDHPNPKGXCDLKGKYVQI 1058 

NCVKMLC H GTG AITV P+A +Q+S GGAS C+YCR ++HP+ G C L+GK+VQ+ 
Sb 3 ct: 4272 NCVECMLCDHAGTGMAITVKPDATTNQDSYGGASVCIYCRARVEHPDVDG1CKLRGKFVQV 4331 

Query: 1059 PTTCANDPVGFTLRNTVCTVCGMWKGYGCSCDQLREPLMQSADASXFLNGFAV 1217 

P DPV + L + VC VCG W+ CSC + +QS D + FIiNGF V 

Sbjct: 4332 PVG-IKDPVSYVLTHDVCQVCGFWRDGSCSCVS-TDTTVQSKDTN-FLNGFGV 4381 

Alignment 2 of translation 2 sequence 
RNA-directed RNA polymerase (ORF1B) [murine hepatitis virus] 

Identities =24/44 (54%), 

Query: 1199 FKRVCGVSA-ARLTPCGTGTSTDVVYRAFDIYNEKVAGXAKFLK 1327 

FKRV G S ARL PC 4G TDV RAFDI N AG + K 
Sbjct: 18 FKRVRGTSVNARLVPCASGLDTDVQLRAFDICNANRAGIGLYYK 61 
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Note that this sequence is not fully in frame. 

UGACAUCUUACGCGUAUAUGCUAACUUAGGUGAGCGUGUACGCCAAUCAUUAUUAAAGACU 
GUACAAUUCUGCGAUGCUAUGCGUGAUGCAGGCAUUGUAGGCGUACUGACAUUAGAUAAUC 
AGGAUCUUAAUGGGAACUGGUACGAUUUCGGUGAUUUCGUACAAGUAGCACCAGGCUGCGG 
5 AGUUCCUAUUGUGGAUUCAUAUUACUCAUUGCUGAUGCCCAUCCUCACUUUGaCUAGGgCA 
UUGGCUGCUGAGUCCcAUAUGGAUGCUGAUCUCGCAAAaCCACUUAUUAaGUGGgAUUUGC 
UGAAACAUGAlJfJUUACGGAAGAGAGACUUUGUCUCUU^ 

CCAGACAUACCAUCCCAAUUGUAUUAACUGUUUGGAUGAUAGGUGUAUCCUUCAUUGUGCA 
AaCUUUAAUGUGUUAUUUUCUACUGUGUUUcCACCUACAAGUUUUGGACCACUAGUAAGAA 
1 0 AAAUAUUUGUAGAUGGUGUUCCUUCUGUUGUUUCAACUGGAUACCAUUUUCGUGAGUUAGG 
AGUCGUACAUAAUCAGGAUGUAAACUUACAUAGCUCGCGUCUCAGUUUCAAGGAACUUUUA 
GUGUAUGCUGCUGAUCCAGCUAUGCAUGCAGCUUCUGGCAAUUAAUUGCUAGAUAAACGCA 
OJACAUGCUUUUCAGUAGCUCCACUAACAAACAAUGUUGCUUUUCAAACUGUCAAACCCGG 
UAAUUUUAAUAAAGACUUuTJAUGACUUUGCUGUGUCUAAA 

15 

Translation 1 

Nucleotide 2 to 652: Frame 2; 217 aa 

DILRVYANLGERVRQSLLKTVQFCDAMRDAGIVGVLTLDNQDLNGNWYDFGDFVQVAPGCGVPIVDSYYSLLM 
PILTLTRALAAESHMDADLAKPLIKWDLLKHDFTEERLCLFDRYFKYWDQTYHPNC1NCLDDRCILHCANFNV 
20 LFSTVFPPTSFGPLVRKIFVDGVPSVVSTGYHFRELGVVHNQDVNLHSSRLSFKELLVYAADPAMHAASGN 

• Translation 2 
656 to 772: Frame 2; 39 aa 
LLDKRTTCFSVAPLTNNVAFQTVKPGNFNKDFYDFAVSK 

25 

Alignment 

ORFlab polyprotein Murine hepatitis virus 
Identities = 157/257 (61%), 

3 0 Query: 2 DILRVYftNLGERVRQSLLKTVQFCDAMRDAGIVGVLTLDNQDLNGNWYDFGDFVQVAPGC 181 
DI+ VY LG ++LL T +F DA+ +AG+VGVLTLDNQDL G WYDFGDFV+ PGG 
Sbjct: 4 62 6 DIINVYKKLGPIFNRALLNTAKFADALVEAGLVGVLTLDNQDLYGQWYDFGDFVKTVPGC 4 685 

Query: 182 GVPIVDSYYSLLMPILTLTRALAAESHMDADLAKPLIKWDLLKHDFTEERLCLFDRYFKY 361 
J => GV + DSYYS +MP+LT+ AL +E ++ + +DL+++DFT+ +L LF +YFK+ 

Sbjct: 4 68 6 GVAVADSYYSYMMPMLTMCHALDSELFVNGTYRE FDLVQYDFTDFKLELFTKYFKH 4741 

Query: 362 WDQTYHPNCINCLDDRCILHCANFBVLFSTVFPPTSFGPLVRKIFVDGVPSVVSTGYHFR 541 
. W TYHPN C DDRCI+HCANFN+LFS V P T FGPLVR+IFVDGVP VVS GYH++ 

4U Sbjct: 4742 WSMTYHPNTCECEDDRCIIHCAMFNILB'SMVLPKTCFGPLVRQIFVDGVPFVVSIGYHYK 4801 

Query: 542 ELGWHNQDVNLHSSRLSFKELLVYAADPAMHAASGN*LLDKRTTCFSVAPLTNNVAFQT 721 
ELGVV N DV+ H RLS K+LL+YAADPA+H AS + LLD RT CFSVA +T+ V FQT 

Sbjct: 4802 ELGVVMNMDVDTHRYRLSLKDLLLYAADPALHVASASALLDLRTCCFSVAAITSGVKFQT 4861 

Query: 722 VKPGNFNKDFYDFAVSK 772 
VKPGNFN+DFY+F +SK 

Sbjct: 48S2 VKPGNFNQDFYEFILSK 4878 

50 EMC -7 . 

. ACCDUCAGAAUUAUGGUGAAAAUGCUGUDAUACCAMAAGGAAUAADGAUGAAUGUCGCAAAGUAUAGUCAACO 
GUGUCAAUACUUAAAUACACDUACUUUAGCUGDACCCUACAACAUGAGAGUUAUUCACUUUGGUGCUGGCUCD 
GAUAAAGGAGUUGCACCAGGOACAGCUGUGCUCAGACAAUGGCJUGCCAAC0GGCACACUACUUGUCGAUUCAG 
AUCUUAAUGACDUCGUCUCCGACGCAGAUUCUACUUUAAUUGGAGACUGUGCAACAGUACAUACGGCUAAUAA 

5 5 AOGGGACCUOADUAUUAGCGADAUGUAUGACCCDAGGACCAAACAUGUGACAAAAGAGAAUGACUCUAAAGAA 

GGGUDUUUCACUUAUCUGOGUGGAUUOAUAAAGCAAAAACDAGCCCOGGGUGGUDCUADAGCUGUAAAGAUAA 
CAGAGCADUCDDGGAAUGCUGACC0UUACAAGCUUAUGGGCCAUOUCUCADGGUGGACAGCUUUUGUUACAAA 
UGUAAAUGCAUCADCAUCGGAAGCAUUDUUAAUUGGGGCUAACOA0CUUGGCAAGCCGAAGGAACAAAUUGAU 
GGCUAUACCAUGCAUGCUAACUACAUUDUCUGGAGGAACACAAADCCUAUCCAGUUGUCUOCCUAOUCAGUCU 
60 UUGACAUGAGCAAAUUUCCUCUUAAAUDAAGAGGAACUGCaGOAAaGUCUCUOAAGGAGAADCAAAUCAADGA 
UAUGAUUUAUUCUCUUCUGGAAA7AAGGDAGGCUUAUCAUUAGAGAAAACAACAGAGUUGUGGUDUCAAGUGAU 
AOUC0UGUUAACAACUAAACGAACAUGDUUADUUUCUUAUOAUUUCUUACUCUCACUAGUGGUAGDGACCDUG 
ACCGGUGCACCACUUUUGAUGAOGUUCAAGCUCCUAAU0ACACUCAACADACDUCAUCUAUGAGGGGGGUUUA 
CUAUCCUGAUGAAADUUUUAGAUCAGACACOCOUUADUUAACUCAGGAUOUAaUUCUDCCAaUUUAUUCUAAO 

6 5 GUUACAGGGUUUCADACUAUUAAUCAOACGOUUGGCAACCCUGUCAUACCUUUUAAGGAUGGUAUUUADUUUG 

CUGCCACAGAGAAAOCAAAUGUUGUCCGDGGUOGGGUUUCJUGGUaCUACCAUGAACAACAAGUCACAGQCGGU 
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GAUUAUUAUDAACAAUUCUACaAAUGUUGUUAUACGAGCADGUAACaOUGAAUDGUGUGACAACCCUUUCUDU 
GCUGUDUCUAAACCCAUGGGUACACAGACACAUACUADGAUAUaCGAUAAUGCAUUUAAUUGCACDDUCGAGO 
ACAUAUCDGAUGGCUUUUCGCUUGAUGUUUCAGAAAAGUCAGGUAAUUUUAAACACUUACGAGAGUUUGUGUU 
UAAAAAUAAAGADGGGUUaCUCUADGUOUAUAAGGGCUAUCAACCUAUAGAOGUAGUUCGUGAUCUACCUUCU 
5 GGUUUaAACACUUDGAAACCUAUUUUUAAGUOGCCOCUUGGUAUUAACAUUACAAAOUUOAGAGCCAUUCOUA 
CAGCCUUDUCACCUGCUCAAGACAUUUGGGGCACGUCAGCUGCAGCCUAUUUUGDUGGCUAUDUAAAGCCAAC 
UACAUaUAUGCaCAAGUAUGAUGAAAAUGGUACAAUCACAGAOGCUGUUGAUUGOUCUCAAAAUCCACUUGCU 
GAACUCAAAUGCUCQGUUAAGAGCUDUGAGAUUGACAAAGGAAUUUACCAGACCQCUAAUDUCAGGGUDGDUC 
CCUCAGGAGAUGUUGUGAGAUUCCCUAAUAUOACAAACUUGUGUCCDODUGGAGAGGUUUUUAAUGCDACUAA 

10 AUUCCCUDCUGUCUAUGCAUGGGAGAGAAAAAAAAUUOCUAAUUGUGUUGCUGAUUACUCUGUGCUCUACAAC 
UCAACAUOUUUUUCAACCUUDAAGUGCDAUGGCGUUUCUGCCACUAAGUaGAADGADCUUUGCDUCUCCAAaG 
UCUAUGCAGAUUCUUUUGDAGUCAAGGGAGAUGAUGUAAGACAAAUAGCGCCAGGACAAACUGGUGUUAUUGC 
UGAUUAUAADUAUAAAUUGCCAGADGAUDUCAUGGGUDGUGUCCDUGCDDGGAAUACUAGGAACAUUGAUGCO 
ACUUCAACUGGUAAUDAUAAUUAUAAAUAUAGGUAUCUUAGACAUGGCAAGCUUAGGCCCUUUGAGAGAGACA 

15 UAUCUAAUGUGCCUUUGUCCCCUGAUGGCAAACCUUGCACCCCACCUGCaCUUAAUUGUUAOUGGCCAUUAAA 
UGAUUAUGGOUUUUACACGACUACDGGCAUUGGCUACCAACCUOACAGAGUUGUAGUACUUUCUUDUGAACUU 
UUAAAUGCACCGGCCACGGUUUGUGGACCAAAAUUAUCCACUGACCUUAUUAAGAACCAGUGUGUCAAUUUUA 
AUUUUAAaGGACUCACUGGUACUGGUGUGUUAACUCCUUCUUCAAAGAGAUUUCAACCAOUUCAACAAUUUGG 
CCGUGAOGUUUCUGAUUUCACaGAUUCCGUDCGAGAUCCUAAAACAUCUGAAADAUUAGACAUDUCACCUUGG 

2 0 UCUUUUGGGGGUGUAAGUGDAAU0ACACCUGGAACAAAUGCDUCADCUGAAGDUGCUGUOCUAUAUCAAGADG 
UDAACUGCACUGAUGDUUCUACAGCAAOOCAUGCAGADCAACUCACACCAGCUUGGCGCAUAUAUDCDACUGG 
AAACAAUGUAUUCCAGACUCAAGCAGGCUGOCUUAUAGGAGCUGAGCAUGOCGACACUUCUUAOGAGOGCGAC 
AUUCCUAUDGGAGCUGGCADUOGUGCUAGDUACCADACAGUUUCUUUAUDACGDAGUACUAGCCAAAAAUCUA 
DUGUGGCUUAUACUAUGUCDUUAGGUGCUGAUAGUUCAAUUGCUDACaCUAADAACACCAUUGCUAUACCUAC 

25 UAACUUUUCAADUAGCAUUACDACRGRAGUAAUGCCUGUUUCUAUGGCUAAAACCUCCGUAGAUDGUAAUAUG 
UACAUCUGCGGAGAUUCUACUGAAOGUGCUAAUUUGCUUCUCCAAUADGGUAGCUUUUGCACACAACUAAAUC 
GUGCACUCUCGUGGUAUUGCUGCCJGAACAGGAUCGCAACACAC 

Translation 2 

30 

Nucleotides 3 to 818: Frame 3 272 aa (orf lab) 

LQNYGENAVIPQGIMMNVAKYTQLCQYLNTLTLAVPYNMRVIHFGAGSDKGVAPGTAVLRQWLPTGTLLVDSDLNDFVSDA 
DSTLIGDCATVHTANKWDLIISDMYDPRTKHVTPCENDSKEGFFTYLCGFIKQKLALGGSIAVKITEHSWNADLYKLMGHFS 
WWTRFVTHVNASSSEAFLIGANYLGKPKEQIDGYTMHANYIFWRNTNPIQLSSYSLFDMSKFPLKLRGTAVMSLKENQIND 
35 MIYSLLEKGRLII RENNRVWS S DI IiVNN 



Translation 2 

4 0 Nucleotide 828 to 3089: Frame 3 756 aa (S protein) 

MFIFLLFLTLTSGSDLDRCTTFDDVQAPNYTQHTSSMRGVYYPDEIFRSDTLYLTQDLFLPFYSNVTGFHTINHTFGNPVI 
PFKDGIYFAATEKSNWRGWVFGSTMNNKSQSVIIINNSTNWIRACNFELCDNPFFAVSKPMGTQTHTMIFDNAFNCTFE 
YISDAFSLDVSEKSGNFKHLREFVFKNKDGFLYVYKGYQPIDVVRDLPSGFNTLKPIFKLPLGINITNFRAILTAFSPAQD 
IWGTSAAAYFVGYLKPTTFMLKYDENGTITDAVDCSQNPLAELKCSVKSFEIDKGIYQTSNFRVVPSGDWRFPNITNLCP 

45 FGEVFNATKFPSVYAWERKKISNCVADYSVLYNSTFFSTFKCYGVSATKLNDLCFSNVYADSFWKGDDVRQIAPGQTGVI 
ADYNYKLPDDFMGCVLAWNTRNIDATSTGNYNYKYRYLRHGKLRPFERDISNVPFSPDGKPCTPPALNCYWPLNDYGFYTT 
TGIGYQPYRVWLSFELLNAPATVCGPKLSTDLIKNQCVNFNFNGLTGTGVLTPSSKRFQPFQQFGRDVSDFTDSVRDPKT 
SEILDISPCSFGGVSVITPGTNASSEVAVLYQDWCTDVSTAIHADQLTPAWRIYSTGNNVFQTQAGCL1GAEHVDTSYEC 
DIPIGAGICASYHTVSLLRSTSQKSIVAYTMSLGADSSIAYSNNTIAIPTNFSISITTEVMPVSMAKTSVDCNMYICGDST 

50 ECANLLLQYGSFCTQLNRALSWYCC 



Alignment 1 of translation 1 
55 replicase [bovine coronavirus] 
Identities = 183/271 (67%), 

Query: 3. LQNYGENAVIPQGIMMNVAKYTQLCQYLNTLTLAVPYNMRVIHFGAGSDKGVAPGTAVLR 182 
L NYG+ +P G MMNVAKYTQLCQYLNT TLAVP NMRV+H GAGS +KGVAPG4- AVLR 
60 Sbjct: 6822 LWNYGKPVTLPTGCMMNVAKYTQLCQYLNTTTLAVPVNMRVLHLGAGSEKGVAPGSAVLR 6881 

Query: 183 QWLPfGTLLVDSDLNDFVSDADSTLIGDCATVHTANKWDLIISDiaYDPRTKHVTKENDSK 362 
QWLP GT+LVD+DL FVSD+ +T GDC T+ +WDLI I S DM YDP TK++ + N SK 
^ Sbjct: 6882 QWLPAGTII1VDNDLYPFVSDSVATYFGDCITI1PFDCQWDLIISDMYDPITKNIGEYNVSK 6941 

Query: 363 EGFFTYLCGFIKQKLALGGSIAVKITEHSWNADLYKLMGHFSWWTAFVTNVNASSSEAFL 542 
+GFFTY+C 1+ KLftLGGS+A+KITE SWNA+LYKLMG+FI+WT F TN NASSSE FL 
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Sbjct: 6942 DGFFTYICHMIRDKIALGGSVAIKITEFSWNAELYKLMGYFAFWTVFCTNMIASSSEGFL 7001 

Query: 543 IGANYLGKPKEQIDGYTMHANYIFWRNTNPIQLSSYSLFDMSKFPLKLRGTAVMSLKENQ 722 

IG NYLGKPK +IDG MHANY+FWRN+ +YSLFDM+KFPLKL GTAV++L+ +Q 

Sbjct: 7002 IGINYLGKPKVEIDGNVMHANYLFWRNSTVWNGGAYSLFDMAKFP1KLAGTAVINLRADQ 7061 

Query: 723 INDMIYSLLEKGRLIIRENNRWVSSDILVN 815 

INDM+YSLLEKG+L++R+ N+ V D LVN 
Sbjct: 7062 INDMVYSLLEKGKLLVRDTNKEVFVGDSLVN 7092 

Alignment 2 (Spike protein of coronavirus) 

E2 glycoprotein precursor - murine hepatitis virus (strain JHM) ; contains 
spike glycoprotein 

(24%), Positives = 314/798 (39%), Gaps = 48/798 (6%) 

Query: 828 MFIFLLFLTLTSGSDLDRCTTFDDVQAPNYTQHTSSM RGVYYP-DEI 965 

+F+F+L L G D F +Q NY 4 +S RG.YY D + 

Sbjct: 2 LFVFILLLPSCLGYIGD FRCIQTVNYNGNNASAP S I STEAVDVSKGRGT YYVLDRV 57 

Query: 966 FRSDTLYLTQDLFLPF — - — YSNV — TGFHTINHTFGNP — VTPFKDGIYFAATE-KSNV 1118 

+ + TL LT + P Y N+ TG +T++ T+ P + F DGI+ . K+N 

Sbjct: 58 YLNATLLLTG — YYPVDGSNYRNLALTGTNTLSLTKFKPPFLSEFNDGIFAKVQNLKTNT 115 

Query: 1119 VRGW VFGSTMNNKXXXXXXXXXXXXXXXRACNFELCDNPFFAVSKPMGTQTHT 1277 

G V GS N C + +C P+ KP 
Sbjct: 116 PTGATSYFPT1VIGSLFGNTSYTVVLEPYNNIIMASVCTYTICQLPY-TPCKP — 167 

Query: 1278 MIFDNAFNCTFEYISDAFSLDVSEKSGNFKHLREFVFKNKDGFLYVY — -KGYQPIDWR 144 8 

N + +DV KRFF +LY + +G 

Sbjct: 168 NTNGNRVIGFWHTDVKPPICLLK — RM FTFNVNAPWLYFHFYQQGGT FYAYYA 218 

Query: 1449 DLPSGFNTLKPIFKLPLG1NITNFRAILTAFSPAQDIWGTSAAAYFVGYLKPTTFMLKYD 1628 

D PS L F + +G +T + + +P T A Y+V L ++ ++ 

Sbjct: 219 DKPSATTFL FSVYIGDILTQYFVLPFICTPTAG— STLAPLYWVTPLLKRQYLFNFN 273 

Query: 1629 ENGT1TDAVDCSQNPLAELKCSVKSFE1DKGIYQTSNFRWPSGDVVR-FPNITNLCPFG 1805 

E G IT AVDC+ + ++E+KC +S G+Y S + V P G V R PN+ +- C 

Sbjct: 274 EKGVITSAVDCASSYISEIKCKTQSLLPSTGVYDLSGYTVQPVGVVYRRVPNLPD-CKIE 332 

Query: 1806 EVFNATKFPSVYAWEREOaSNCVADYSVLYNSTFFSTFKCYGVSATKLNDLCFSNVYADS 1985 

E A PS WER+ NC + S L + C + A+K+ +CF +V D 

Sbjct: 333 EWLTAKSVPSPLNWERRTFQNCNFNLSSLLRYVQAESLSCNNIDASKVYGMCFGSVSVDK 392 

Query: 1986 FVVKGDDVRQIAPGQTGVIADYNYKLPDDFMGCVLAWNTRNIDATSTGNYNYKYRYLRHG 2165 

F + + G +G + NYK+ C L ++ + T NYN R+G 

Sbjct.: 393 FAIPRSRQIDLQIGNSGFLQTANYKIDTAATSCQLYYSLPKNNVT-INNYNPSSWNRRYG 451 

Query: 2166 KLRPFERDISNVPFSPDGKPCTPPALNCYWPLNDYGFYTTTGIGYQPYRVWLSFELLNA 2345 

+ +ND R + + LLN 

Sb 3 ct: 452 — ~ — FKVND RCQIFANILLNG 468 

Query: 2346 PATVCGPKL STDLIKNQCVN FN FNGLTGTGVLTP- S SKRFQPFQQFGRDVS DFTD 2507 

T C L +T++ CV ++ G+TG GV + + +Q DV+ + 

Sbjct: 469 ' INSGTTCSTDLQLPNTEVATGVCVRYDLYGITGQGVFKF.VKADYYNSWQALLYDVNGNLN 528 

Query: 2508 SVRDPKTSEILDISPCSFGGVSVITPGTNASSEVAVLYQDVNCTDVSTAIHADQLTPAWR 2687 

RD T++ I C G VS + E A+LY+++NC+ V T + + P 

Sbjct: 529 GFRDLTTNKTYTIRSCYSGRVSAAY — HKEAPEPALLYRNINCSYVFTNNISREENPL — 584 

Query: 2688 IYSTGNNVFQTQAGCLIGAEH— VDTSYECDIPIGAGICASYHTVSLLR— STSQK— S 284 6 

■ N F + ' GC++ A++ + C++ +GAG+C Y R ST + + 

Sbjct: 585 NYFDSYLGCVVNADNRTDEALPNCNLRMGAGLCVDYSKSRRARRSVSTGYRLTT 638 

Query: 2847 IVAYTMSLGADSSIAYSN-NTIAIPTNFSISITTEVMPVSMAKTSVDCNMYICGDSTECA 3023 

Y L DS + + IPTNF+I E + + K ++DC ++CGD+ C 

Sbjct: 639 FEPYMPMLVNDSVQSVGGLYEMQIPTNFTIGHHEEFIQIRAPKVTIDCAAFVCGDNAACR 698 
Query: 3024 NLLLQYGSFCTQLNRALS 3077 



ISDOCID: <WO 2004089983A2_L> 



BNS page 65 



WO 2004/089983 



10/36 



PCT/NL2004/000229 



Fig- 2. Cont. 



L++YGSFC +N L+ 
Sbjct: 699 QQLVE YG S FC DNVNAI LN 716 

RDG1 seq 



aDCAAAGCcUUCAAACNUAUGUAR.CACAACAA.CUAAUCAGGGMUGcUGAAAUCHCGSCOUCUGCOAAUCUUGC 
UGCUACUAAAAUGUCUGAGUGUGUUCUUGGACAAOCAAAAAGAGUUGACUUUUGUGGAAAGGGCUACCACCUU 
AUGUCCUUCCCACAAGCAGCCCCGCAUGGUGUUGUCUUCCUACAUGUCACGUAUGOGCCAUCCCAGGAGAGGA 
ACUUCACCACAGCGCCAGCAAUUUGUCAUGAAGGCAAAGCAUACOUCCCOCGUGAAGGUGDOUUOGUGUUUAA 
UGGCACUUCDOGCUDOAUUACACAGAGGAACUUCaOUUCOCCACAAAUAAUUACUACAGACAAUACAUUUGUC 
UCAGGAAAUUGUGAUGDCGUUAUDGGCAOCAUUAACAACACAGUUUAUGAUCCUCDGCAACCUGAGCUUGACU 
CAUUCAAAGAAGAGCUGGACAAGUACUUCAAAAAUCADACAUCACCAGAOGUUGAUCUDGGCGACAUUDCAGG 
CAUUAACGCOUCUGOCGUCAACAUUCAAAAAGAAAUUGACCGCCUCAADGAGGUCGCUAAAAAUUUAAAUGAA 
DCACUCAUUGACCUUCAAGAAUUGGGAAAAUAUGAGCAAUAUADUAAGUGgCCCUGGUACGUCUGGGU 

Translation I 

Nucleotides 3 to 650: Frame 3; 216 aa 

QSLQXYVTQQLIRXAEIXXSANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQAAPHGWFLHVTYVPSQERNFTTAPAIC 
HEGKAYFPREGVFVFNGTSWFITQRNFFSPQIITTDNTFVSGNCDVVIGIINNTVYDPLQPELDSFKEELDKYFKNHTSPD 
VDLGDISGINASWNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYVW 

Translation 2 

Nucleotides 37 to 339: Frame 1? 101 aa 

SGXLKXXLLLILLLLKCLSVFLDNQKELTFVERATTLCPSHKQPRMVLSSYMSRMCHPRRGTSPQRQQFVMKAKHTSLVKV 
FLCLMALLGLLHRGTSFLHK 

Translation 3 

Nucleotides 343 to 576: Frame 1;' 78 aa 

LLQTIHLSQE IVMSLLASLTTQFMILCNLSLTHSKKSWTSTSKI IHHQMLILATFQALTLLSSTFKKKLTASMRSLKI 

Alignment of translation 1 
S glycoprotein [murine hepatitis virus] 
Length = 1.37 6 

Identities = 86/218 (39%), Positives = 129/218 (59%), Gaps = 3/218 (1%) 
Frame = +3 

Query: 6 SLQTYVTQQLIRXAEIXXSANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQAA.PHGWF 185 

+L Y+++QL I SA A K++ECV Q+ R++FCG G H++S Q AP+G+ F 

Sbjcf: 1105 ALNAYISKQLSDSTLIKFSAAQAIEKVNECVKSQTTRINFCGNGNHILSLVQNAPYGLYF 1164 

Query: 186 LHVTYVPSQERNFTTAPAICHEG-KAYFPREGVFVFNGTSWFITQRNFFSPQIITTDNTF 362 

+H +YVP+ +P +C G + P+ G FV + W T +++ P+ IT N+ 

Sbjct: 1165 IHFSYVPTSFTTANVSPGLCISGDRGLAPKAGYFVQDDGEWKFTGSSYYYPEPITDKNSV 1224 

Query: 363 VSGNCDWIGIINNTVYDPLQPELDSFKEELDKYFKNHTS — PDVDLGDISGINASWNI 536 

V +C V + .+ PL FKEELDK+FKN TS PD+ L D +N + +++ 

Sbjct: 1225 VMSSGSVNYTKAEEVLLNSSIPNLPDFKEELDKWFKNQTSIAPDLSL-DFEKLNVTFLDL 1283 

Query: 537 QKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYVW 650 

E++R+ E K LNES I+L+E+G YE Y+KWPWYVW 
Sbjct: 1284 SDEMNRIQEAIKKLNESYINLKEVGTYEMYVKWPWYVW 1321 



KMC- 8 

AGGCCAAAACAGCGCCGACCCCAAGGUUUACCCAAUAAUACUGCGUCUUGGUUCACAGCUCUCACUCAGCAUG 
GCAAGGAGGAACUUAGAUUCCCUCGAGGCCAGGGCGUOCCAAOCAACACCAAUAGUGGUCCAGADGACCAAAU 
DGGCUACOACCGAAGAGCUACCCGACGAGUUCGUGGUGGUGACGGCAAAAUGAAAGAGCUCAGCCCCAGAUGG 
UACUUCUAUDACCUAGGAACUGGCCCAGAAGCUUCACUUCCCUACGGCGCUAACAAAGAAGGCAUCGUAUGGG 
DUGCAACUGAGGGAGCCUUGAADACACCCAAAGACCACAUUGGCACCCGCAAUCCDAAUAACAAUGOUGCC 

Translation 

Nucleotides I to 363: Frame 1; 121 aa 

RPKQRRPQGLPNNTASWFTALTQHGKEELRFPRGQGVPINTNSGPDDQIGYYRRATRRVRGGDGKMKELSPRWYFYYLGTG 
PEASLPYGANKEGIVWVATEGALNTPKDHIGTRNPNNNXA 
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Alignment 

nucleocapsid protein - bovine coronavirus (strain Mebus) 
5 Identities =55/129 (42%), 

Query: 1 RPKQRRPQGLPNNTA SWFTAIiTQHGK-EELRFPRGQGVPINTNSGPDDQIGYYRR 162 

„, . +EKQ LP+ SWF+ +TQ K +E F GQGVPI + GY+ T? 

iQ Sbjct: 44 QPKQTATSQLPSGGNVVPYYSWFSGITQFQKGKEFEFAEGQGVPIAPGVPATEAKGTOYR 103 

Query: 163 ATRR-VRGGDGKMKELSPRWYFYYLGTGPEASLPYGANKEGIVWVATEGA-LNTPKDHIG 336 

RR + DG ++L PRWYFYYLGTGP A YG + +G+ WVA+ A +NTP D I 
Sbjct: 104 HNRRSFKTADGNQRQLLPRWYFYYLGTGPHAKDQYGTDIDGVFWVASNQADVNTPAD-IL 162 
15 Query: 337 TRNPNNHXA 363 
Sbjct: 1S3 DRDPSSDEA 171 

EMC- 11 : unknown sequence 

ACUCDDUCCGAACAACUUGADUACAUCGAGUCGaAGAGAGGDGUCUACUGCUGCCGUGACCMGAG^G^ 
25 UUGCCUgGGUUCACUGAGCGCUCUGAUAAGAGCUACGAG^ 

AAaUUGACACUDUCAAAAGGGG/AAUGCCCCAAAGCDUGUGDUUCCUCUUTAACUCAAAAGUCAAAGTC 
CCACGUGDUGAAAAGAAAAAGACUGAGGGUUUCAUGGGGCGUAUACGCUCOGDGUACCCUGU^CMCT 

aggaguguaacaauaugcacuugocuaccuugaogaaauguaaucauugcgaug^ 

CGACUUUCUGAAAGCCACUUGUGAACAUUGUGGCACUGAAAAUUUAGUUAUUGAAG^ 

30 uaccuaccuacuaaugcuguagugaaaaugccauguccugccugdcaagacccagagauugS 
guguugcagauuaucacaaccacocaaacadugaaacucgacuccgcaagggagguaggac^ 
aggcugugugouugccoauguoggcugcoauaauaagcgugccuacogg^^ 
ggcucagggcauacdggcaudacuggogacaauguggagaccuugaaugaggaucuccudgagaua^ 
gugaacguguuaacaduaacaudgduggcgauuuocaoudgaaugaagaggdogccaucayto 

35 cucugcuucuacaagdgccuuuauugacacuauaaagagucougauuacaagucuducS 
uccugcgguaacuauaaaguuaccaagggaaagcccguaaaaggugcuogg^ 

UUUUAACACCACUGQGUGGUUUUCCCaCACAGGCUGCUGGOGUUAUCAGAUCAAOT^ 
UGCAGCAAACCACDCAAUDCCOGAUUUGCAAAGAGCAGCUGUCACCAUACUUGAUGGDAUUOCUGA^S 

40 

Translation of putative open reading frames 

45 >~out: 78 to 1: Frame -2 26 aa 

DFRSCHSYDHRLPYYRSYEEEHCGMQ 
>~out: 59 to 379: Frame 2 107 aa 

iWHDLKSYDLGDELGTDPIEDYEQt 

:qldyieskrgvyccrdheheiaw\ 

-out: 283 to 89: Frame - 

LARACEKIFDAIKRVPIWATEIWDIASDCTSIELTSEFTECTTAMLSVPVLFIIFNGISAKLVT 
>~out: 90 to 614: Frame 3 175 aa 

>~out: 204 to 124: Frame -2 27 aa 

RVTAPPLSSRVSSRSAPLPCLVFQFCS 
>~out: 312 to 208: Frame -2 35 aa 

S SCSERVHI DLPARARKSLMQSRGYPSG PQKLLST 
>~out: 485 to 258: Frame -3 76 aa 

>^out LG ?97 F to™^ 

LLSERSVWPGNFMLMVTAAVDTSLRLDVIKLFGKSAH 
>~out: 364 to 486: Frame 1 41 aa 

NCLGSLSALIRATSTRHPSKLRVPRKLTLSKGECPKACVSS 
>~out: 490 to 401: Frame -1 30 aa 

VKRKHKLWGIPLLKVSIFLALLISKGVWCS 
>~out: 446 to 1483: Frame 2 346 aa 
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HFQKGNAPKLVFPLNSKVKVIQPRVEKKKTEGFMGRIRSVYPVASPQECNNMHLSTLMKCNHCDEASWQTCDFLKATCEHC 
GTENLVIEGPSTCGYLPTNAWKMPCPACQDPEIGPEHSVADYHNHSNIETRLRKGGRTRCFGGCVFAYVGCYNKRAYWVP 
RASADIGSGHTGITGDNVETLNEDLLEILSRERVMINIVGDFHLNEEVAIXLAXFSASTSAFIDTIKSLDYKSFKTIVE5C 
GNYKVTKGKPVKGAWNIGQQRSVLTPLCGFPSQAAGVIRSIFARTLDAANHSIPDLQRAAVTILDGISEQSLRLVDAMVYT 
5 SDLLTNSVIIMAYVTGGLVQQT . 

>~out: 643 to 494: Frame -1 50 aa 

SFIAMITFHQGRQVHIVTLLWRCNRVHRAYTPHETLSLFLFNTWLNDFDF 
>~out: 627 to 511: Frame -2 39 aa 

LHFIKVDKCILLHSCGDATGYTERIRPMKPSVFFFSTRG 
10 >~out: 704 to 612: Frame -3 31 aa 

LNFQCHNVHKWLSESRTSAMKLHRNDYISSR 
>~out: 774 to 631: Frame -2 48 aa 

QAGHGIFTTALVGRYPHVLGPSITKFSVPQCSQVAFRKSHVCHEASSQ 

. >~out: 826 to 737: Frame -1 . 30 aa 

15 WVIICNTMLRSNLWVLTGRTWHFHYSISR 

>~out: 863 to 744: Frame -3 40 aa 

SYLPCGVEFQCLSGCDNLQHYAQVQSLGLDRQDMAFSLQH 
>~out: 756 to 992: Frame 3 79 aa 

KCHVLPVKTQRLDLSIVLQIITTTQTLKLDSAREVGLDVLEAVCLPMLAAIISVPTGFLVLVLILAQAILALLVTMWRP 
20 >~out: 952 to 830: Frame -1 41 aa 

ANI STSTRNPVGTLI IAANIGKHTASKTSSPTSLAES SFNV 

>~out: 1056 to 922: Frame -2 45 aa 

KSPTMLMLTRSRLSISRRSSFKVSTLSPVMPVWPEPISALARGTQ 

>~0Ut: 1237 to 956: Frame -1 94 aa. 

25 SLLSNVPSTFYGLSLGNFIVTAGLNNGFERLVIKTLYSVNKGTCRSREXCQXDGNLFIQMKIANNVNVNTFTTQYLKEILI 

QGLHIVTSNASMA 

>~out: 1140 to 1060: Frame -2 27 aa 

SRLFIVSIKALVEAEXDAKXMATSSFK 
>~out: 1131 to 1205: Frame 3 25 aa 

30 RVLITSLSKPLLSPAVTIKLPRESP 

>~out: 1410 to 1183: Frame -2 76 aa 

TMASTRRNDCSEIPSSMVTAALCKSGIEWFAAS SVRAKIDLITPAACEGKPHSGVKTDLCCPMFQAPFTGFPLVTL 
>~out: 1186 to 1311: Frame 1 42 aa 

SYQGKARKRCLEHWTTEISFNTTWFSLTGCWCYQINFCAHT 
35 >~out: 1283 to 1191: Frame -3 31 aa 

HQQPVRENHTWLKLISVVQCSKHLLRAFPW 
>-out: 1248 to 1457: Frame 3 70 aa 

HHCWFPHRLLVLSDQFLRAHLMQQTTQFL1CKEQLSPYLMVFLNSHYVLSTPWFILQTCSPTVSLLWHM 
>~out: 1381 to 1482: Frame 1 34 aa 

'4 0 TVITSCRRHGLYFRPAHQQCHYYGICNWWSCTTD 

EMC12: unknown sequence ■ 

UGCUUGCUCAUGCUGAAGAGACAAGAAAAUUAAUGCCUAUAUGCAUGGAUGUUAGAGCCAU 
AAUGGCAACCAUCCAACGUAAGUAUAAAGGAAUUAAAAUUCAAGAGGGCAUGGUUGACUAU 

4 5 GGUGUCCGAUUCUUCUUUUAUACUAGUAAAGAGCCUGUAGCUUCUAUUAUUACGAAGCUGA 

ACUCUOTAAAUGAGCCGCUUGUCACAAUGCCAAUUGGUUAUGUGACACAUGGUUUUAAUCU 
UGAAGAGGCUGCGCGCUGUAUGCGUUCUCUUAAAGCUCCUGCCGUAGUGUCAGUAUCAUCA 
CCAGAUGCUGUUACUACAUAUAAUGGAUACCUCAC1JUCGUCAUCAAAGACAUCUGAGGAGC 
ACUUUGUAGAAA.CAGUUUCUUUGGCUGGCUCUUACAGAGAUUGGUCCUAUUCAGGACAGCG 

5 0 UACAGAGUUAGGUGUUGAA 



Translation of putative open reading frames 
>~out: 3 to 446: Frame 3 148 aa 

LAHAEETRKLMPICMDVRAIMATIQRKYKGIKIQEGIVDYGVRFFFYTSKEPVASIITKLNSLNEPLVTMPIGYVTHGFNL 
55 EEAARCMRSLKAPAVVSVSSPDAVTTYNGYLTSSSKTSEEHFVETVSLAGSYRDWSYSGQRTELGVE 

>~out: 100 to 11: Frame -2 30 aa 

ILIPLYLRWMVAIMALTSMHIGINFLVSSA 

>~out: 188 to 33: Frame -I 52 aa 

RVQLRNNRSYRLFTSIKEESDTIVNDALLNFNSFILTLDGCHYGSNIHAYRH 
60 >~out: 64 to 159: Frame 1 32 aa 

WQPSNVS IKELKFKRASLTMVSDS S FI LVKSL 

>~out: 220 to 143: Frame -2 26 aa 

PIGIVTSGSFREFSFVIIEATGSLLV 

>~out: 293 to 192: Frame -1 34 aa 

6 5 HYGRSFKRTHTARSLFKIKTMCHITNWHCDKRLI 

>~out: 397 to 224: Frame -2 58 aa 

EPAKETVSTKCSSDVFDDEVRYPLYWTASGDPTDTTAGALRERIQRAASSRLKPCVT 
>~out: 229 to 288: Frame 1 20 aa 
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HMVLILKRLRAVCVLLKLLP 

>-out: 292 to 372: Frame 1 27 aa 

CQYHHQMLLLHIMDTSLRHQRHLRSTL 

>~out: 444 to 340: Frame -3 35 aa 

QHLTLYAVLNRTNLCKSQPKKLFLQSAPQMSLMTK 

>~out: 416 to 351: Frame -1 22 aa 

IGPISVRASQRNCFYKVLLRCL 

>~out: 365 to 445: Frame 2 27 aa 

GALCRNSFFGWLLQRLVLFRTAYRVRC 

>~out: 376 to 435: Frame 1 20 aa 

KQFLWLALTEIGPIQDSVQS 
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Figure 4. 



Comparison of N-termini of the S proteins of the group 2 coronaviruses 

HCV OC4 3 MFLILLISLPTAFAVIGDL-KCTTVSINDID 

MHV A5 9 MLFVFILFLPSCLGYIGDF-RCIQLVNSNGA 

BCV MFLILLISLPMAFAVIGDL-KCTTVSINDVD 

SARS MF-IFLLFL— TLTSG-SDLDRCTTFDDVQAP 
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Figure 6. 
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Figure 7. 
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Fig 11 
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